Key Features

Supports semantic and appearance editing for comprehensive image manipulation
Precise bilingual text editing in both Chinese and English with original font preservation
Dual encoding using Qwen2.5-VL for semantics and VAE for appearance control
Enables creation of original IP content, style transfer, and object rotation
Achieves state-of-the-art performance on multiple image editing benchmarks
Available via Qwen Chat and natively supported on ComfyUI platform

The model’s architecture integrates several innovative components, including Qwen2.5-VL for visual semantic control and a Variational AutoEncoder (VAE) for detailed visual appearance management. These dual encoding mechanisms enable Qwen-Image-Edit to balance semantic coherence and visual fidelity rigorously, maintaining object identity and ensuring unmodified regions remain consistent. This dual approach allows users to perform complex editing tasks, ranging from subtle visual adjustments to significant content transformations, without losing the original image’s contextual meaning or quality.


Designed for both professional content creators and general users, Qwen-Image-Edit is accessible via Qwen Chat with a dedicated 'Image Editing' feature and is also supported natively on platforms like ComfyUI. It achieves state-of-the-art performance across multiple public benchmarks, showcasing its strength and reliability in image editing tasks. By combining powerful semantic and appearance editing with precise bilingual text control, Qwen-Image-Edit significantly lowers the barriers to producing high-quality, customized visual content efficiently.

Get more likes & reach the top of search results by adding this button on your site!

Embed button preview - Light theme
Embed button preview - Dark theme
TurboType Banner

Subscribe to the AI Search Newsletter

Get top updates in AI to your inbox every weekend. It's free!