Text to Video

Discover and compare the best AI models for text to video generation. Note: This is my personal informal leaderboard. Models are ranked by the completion rate of a series of diverse prompts designed to thoroughly assess performance.

RankCompanyModelScore
OpenAI
86
Lightricks
82
Kuaishou
80
4
MiniMax
78
5
Kuaishou
76
5
PixVerse
76
7
MiniMax
74
8
Google
Veo 3.1
73
8
Tencent
73
8
Google
73
8
Alibaba
73
12
Meituan
72
12
ByteDance Seed
72
14
Runway
Runway Gen4.5
71
14
Bytedance
71
14
Kuaishou
Kling 2.1
71
14
Luma Labs
71
14
PixVerse
PixVerse V5
71
19
Kuaishou
66
20
PixVerse
63
21
Alibaba
61
22
OpenAI
59
22
KlingAI
59
24
Pika Art
58
25
Vidu
57
25
Tencent
57
27
Genmo
54
28
Luma Labs
51

Methodology

Models are ranked using a series of prompts involving diverse range of challenging tasks. This includes:

  • Prompt adherence and world understanding
  • Motion and physics
  • Character consistency
  • Scene transitions
  • Camera movement and angles
  • Lighting and shadows
  • Text and object generation
  • NSFW capabilities

To prevent manipulation, the prompts are kept confidential and are regularly updated to increase difficulty as models improve. Here is a subset of prompts for your reference:

A man riding a unicycle and juggling red balls
Will Smith eating spaghetti
A princess wearing a glittery white dress. She is running away from a massive red dragon with glowing red eyes. 3D disney pixar style
A gymnast performs a flip on a balance beam
A professor writes "hello" on the chalkboard
A swarm of zombies causing chaos in a shopping mall, shaky camera
A cat roars while looking at its reflection in the mirror but instead sees itself as a lion roaring
A neon sign with the text "Subscribe to my channel". Cyberpunk city at night.