🖼️ image

Image/Manga Translator

Translate text in various pictures with one click

#Open source

#multilingual

#OCR

#Text detection

#machine translation

#image translation

Image/Manga Translator

Product Details

This project can translate text in comics/pictures. Its main functions include text detection, optical character recognition (OCR), machine translation and image repair. It supports multiple languages such as Japanese, Chinese, English and Korean, and can achieve near-perfect translation effects. This project is mainly aimed at comic lovers and image processing workers, who can easily read foreign language comics or perform multi-language processing of images. In addition, it also provides a variety of usage methods such as Web services, online demonstrations, and command line tools, with good usability. The project code is open source, and everyone is welcome to improve and contribute.

Main Features

1

Image text detection

2

Optical character recognition (OCR)

3

machine translation

4

Image patching

5

Web services

6

Command line tools

Target Users

Multi-language translation and image processing of comic/image text

Examples

✓

Translate Japanese comic pictures into Chinese

✓

Language recognition and translation of text in images

✓

Repair the text area in the image and synthesize new translated text

Quick Access

Visit Website →

Categories

🖼️ image

› AI image editing

› AI translation

Related Recommendations

Discover more similar quality AI tools

ComfyUI-Fluxtapoz

ComfyUI-Fluxtapoz

ComfyUI-Fluxtapoz is a collection of nodes designed for Flux to edit images in ComfyUI. It allows users to edit and style images through a series of node operations, and is especially suitable for professionals who need to perform image processing and creative work. This project is currently open source and follows the GPL-3.0 license agreement, which means that users can freely use, modify and distribute the software, but they need to comply with the relevant provisions of the open source license.

Open source image processing

FaceFusion Labs

FaceFusion Labs

FaceFusion Labs is a leading platform focused on facial manipulation, leveraging advanced technology to enable the fusion and manipulation of facial features. The platform’s main advantages include high-precision facial recognition and fusion capabilities, as well as a developer-friendly API interface. FaceFusion Labs background information shows that it made an initial submission on October 15, 2024, and was developed by Henry Ruhs. The product is positioned as an open source project, encouraging community contributions and collaboration.

Artificial Intelligence Open source

DisEnvisioner

DisEnvisioner

DisEnvisioner is an advanced image generation technology that isolates and enhances subject features to generate customized images without tedious adjustments or reliance on multiple reference images. This technology effectively distinguishes and enhances subject features while filtering out irrelevant attributes, achieving superior personalization quality in terms of editability and identity preservation. The research background of DisEnvisioner is based on the current need in the field of image generation for extracting subject features from visual cues. It solves the challenges of existing technologies in this field through innovative methods.

image generation Customization

FacePoke

FacePoke

FacePoke is an AI-powered real-time head and face transformation tool that allows users to manipulate facial features through an intuitive drag-and-drop interface, breathing life into portraits for realistic animations and expressions. FacePoke utilizes advanced AI technology to ensure that all edits maintain a natural and realistic appearance, while automatically adjusting surrounding facial areas to maintain the overall integrity of the image. This tool stands out for its user-friendly interface, real-time editing capabilities, and advanced AI-driven adjustments, making it suitable for users of all skill levels, whether they are professional content creators or beginners.

AI image processing

Pic Pic AI

Pic Pic AI

Pic Pic AI Editor is a powerful AI picture editing tool that provides a variety of functions such as photo enhancement, background removal, object removal, etc., allowing users to easily edit photos at a professional level. This product is based on a user-friendly interface and efficient AI technology, aiming to simplify the image editing process and improve editing efficiency while ensuring the output image quality. Pic Pic AI editor is suitable for users of all levels, whether they are social media users, e-commerce sellers or professional photographers, who can improve their image processing capabilities through this platform.

photo editing image enhancement

photo4you

photo4you

photo4you is an online ID photo production website based on artificial intelligence technology. Users can easily create ID photos without downloading or installing any software. The website supports a variety of standard sizes for official documents such as passports, visas, and driver's licenses. It automatically removes photo backgrounds through intelligent background removal, ensuring that ID photos have a clear, professional look. Users can download the produced ID photos immediately, saving time and trouble. photo4you provides high-resolution output suitable for printing or digital submission.

AI high resolution

PMRF

PMRF

PMRF (Posterior-Mean Rectified Flow) is a newly proposed image restoration algorithm designed to solve the distortion-perceptual quality trade-off problem in image restoration tasks. It proposes a novel image restoration framework by combining posterior mean and correction flow, which can reduce image distortion while ensuring the perceptual quality of the image.

super resolution repair

DepthFlow

DepthFlow

DepthFlow is a highly customizable parallax shader for animating your images. It is a free and open source ImmersityAI alternative capable of converting images into videos with 2.5D parallax effect. This tool has fast rendering capabilities and supports a variety of post-processing effects, such as vignette, depth of field, lens distortion, etc. It supports a variety of parameter adjustments, can create flexible motion effects, and has a variety of built-in preset animations. In addition, it also supports video encoding and export, including H264, HEVC, AV1 and other formats, and provides a user experience without watermarks.

video production image animation

Minionverse

Minionverse

Minionverse is an AI-based creative workflow that generates images by using different nodes and models. This workflow is inspired by an online glif application and provides a video tutorial to guide users on how to use it. It contains a variety of custom nodes that can perform text replacement, conditional loading, image saving and other operations. It is very suitable for users who need to generate and edit images.

AI image generation

PuLID-Flux ComfyUI implementation

PuLID-Flux ComfyUI implementation

PuLID-Flux ComfyUI implementation is an image processing model based on ComfyUI, which uses PuLID technology and Flux model to achieve advanced customization and processing of images. This project was inspired by cubiq/PuLID_ComfyUI and is a prototype that uses some handy model tricks to handle the encoder part. The developers wish to test the quality of the model before re-implementing it more formally. For better results, it is recommended to use the 16-bit or 8-bit version of the GGUF model.

image processing deep learning

Posterior-Mean Rectified Flow

Posterior-Mean Rectified Flow

Posterior-Mean Rectified Flow (PMRF) is a novel image restoration algorithm that minimizes the mean square error (MSE) by optimizing the posterior mean and rectified flow model while ensuring image fidelity. The PMRF algorithm is simple and efficient, and its theoretical basis is to optimize the posterior mean prediction (minimum mean square error estimate) to match the real image distribution. This algorithm performs well in image restoration tasks, can handle various degradation problems such as noise and blur, and has good perceptual quality.

machine learning deep learning

FaceFusion

FaceFusion

FaceFusion is an industry-leading facial manipulation platform specializing in face swapping, lip sync, and deep manipulation technologies. It utilizes advanced artificial intelligence technology to provide users with a highly realistic facial operation experience. FaceFusion has a wide range of applications in image processing and video production, especially in the entertainment and media industries.

AI image processing

Light and shadow magic hand

Light and shadow magic hand

Light and Shadow Magic Hand is a feature-rich image processing software that provides a variety of photo editing tools and AI technology to help users easily edit and beautify photos. The software has a friendly interface, simple operation, supports a variety of image formats, and is suitable for users of all levels.

AI image processing

StableDelight

StableDelight

StableDelight is an advanced model focused on removing specular reflections from textured surfaces. It builds on the success of StableNormal, which focuses on improving the stability of monocular normal estimation. StableDelight solves the challenging task of removing reflections by applying this concept. The training data includes Hypersim, Lumos, and various specular highlight removal datasets from TSHRNet. Furthermore, we integrate multi-scale SSIM loss and stochastic conditional scaling techniques during diffusion training to improve the clarity of one-step diffusion predictions.

image processing computer vision

Colorful Diffuse Intrinsic Image Decomposition

Colorful Diffuse Intrinsic Image Decomposition

Colorful Diffuse Intrinsic Image Decomposition is an image processing technique that decomposes photos taken in the wild into albedo, diffuse shadows, and non-diffuse residual components. This technique enables the estimation of colorful diffuse shadows in images by progressively removing monochromatic lighting and Lambertian world assumptions, including multiple lighting and secondary reflections in the scene, while modeling specular and visible light sources. This technology is important for image editing applications such as specular removal and pixel-level white balancing.

image processing computer vision

A Diffusion Approach to Radiance Field Relighting using Multi-Illumination Synthesis

A Diffusion Approach to Radiance Field Relighting using Multi-Illumination Synthesis

This is a method of creating re-illuminable radiation fields by leveraging priors extracted from 2D image diffusion models. This method is able to convert multi-view data captured under single illumination conditions into a dataset with multiple illumination effects and represent the re-illuminable radiation field through 3D Gaussian splats. This method does not rely on precise geometry and surface normals, and is therefore more suitable for handling cluttered scenes with complex geometries and reflective BRDFs.

image processing 3D modeling

Browse More Tools