Qwen-Image: A 20B Parameter Image Foundation Model Released
2025-08-05
Alibaba DAMO Academy released Qwen-Image, a 20-billion parameter image foundation model that significantly advances complex text rendering and precise image editing. It boasts high-fidelity text rendering in multiple languages (including English and Chinese), preserving semantic meaning and visual realism during edits. Qwen-Image outperforms existing models across various benchmarks for image generation and editing. Demonstrations showcased its capabilities: generating images with intricate Chinese typography and layouts, crafting detailed PPT slides, and even handling bilingual text rendering, highlighting its robust text processing and image generation abilities.