Text to Image
Discover and compare the best AI models for text to image generation. Note: This is my personal informal leaderboard. Models are ranked by the completion rate of a series of diverse prompts designed to thoroughly assess performance.
| Rank | Company | Model | Score |
|---|---|---|---|
OpenAI | GPT Image 2.0 | 89 | |
Google | Nano Banana 2 (Gemini 3.1 Flash Image) | 87 | |
Google | 84 | ||
4 | OpenAI | 82 | |
5 | Alibaba | 79 | |
5 | ByteDance Seed | 79 | |
5 | Google | 79 | |
8 | Baidu | ERNIE Image | 78 |
8 | Zhipu AI | 78 | |
8 | ByteDance Seed | 78 | |
11 | Alibaba | 77 | |
11 | Alibaba | 77 | |
11 | Black Forest Labs | 77 | |
11 | OpenAI | GPT-4o | 77 |
11 | Meituan | 77 | |
16 | Google | 76 | |
16 | Black Forest Labs | 76 | |
16 | Black Forest Labs | 76 | |
19 | ByteDance Seed | 75 | |
20 | Reve | Reve Image (Halfmoon) | 70 |
21 | Recraft | 68 | |
22 | Ideogram | 64 | |
23 | HiDream | 63 | |
24 | Black Forest Labs | 62 | |
25 | Black Forest Labs | 56 | |
26 | Midjourney | Midjourney v7 Alpha | 44 |
27 | Stability | 42 |
Full tutorial & review videos
Watch the videos below for comprehensive comparisons and detailed installation guides for select text-to-image models.
Methodology
Models are ranked using a series of prompts involving diverse range of challenging tasks. This includes:
- Prompt adherence and understanding
- Human anatomy
- Generating text
- Diagrams and infographics
- World understanding
- Uncommon poses and expressions
- Spatial understanding
- NSFW capabilities
To prevent manipulation, the prompts are kept confidential and are regularly updated to increase difficulty as models improve. Here is a subset of prompts for your reference:
Johnny Depp, Jackie Chan, Taylor Swift, Tom Hanks, the Rock, Micheal Jackson, Oprah, BLACKPINK, Cristiano Ronaldo, Elon Musk, taking a group photo
Emilia from re:zero, Gojo Satoru, Nezuko, Keroro Gunso, Kenny (from South Park), Bart Simpson, Snow White taking a selfie
A page of a school yearbook with a grid of student photos
11:15 on the clock and a wine glass filled to the top
A pair of spectral tarsiers on a tree. realistic photo
A ballerina in a tutu practices spins in a sunlit studio with mirrored walls and barre equipment, scattered with pointe shoes and sheet music. A rabbit watches from atop a grand piano. Outside the large window, an elephant balances on a circus ball.
A screenshot of a YouTube video search for "funniest cats"
A woman sitting and showing her palms and soles of feet
A multi-panel comic of a man explaining a simple home workout routine. In each panel, he should describe a different exercise or fitness tip
A red Ferrari Portofino M, a white Audi R8, and a blue 1994 Honda Civic in the desert


![FLUX.2 [klein]](https://i.ytimg.com/vi/2OrOufa3eoc/hqdefault.jpg)








![FLUX1.1 [pro] Ultra](https://i.ytimg.com/vi/enOlq9bEtUM/hqdefault.jpg)







![FLUX.1 [dev]](https://i.ytimg.com/vi/K3xJ7GQuHpw/hqdefault.jpg)