Text to Video

Discover and compare the best AI models for text to video generation. Note: This is my personal informal leaderboard. Models are ranked by the completion rate of a series of diverse prompts designed to thoroughly assess performance.

Overview Text to Image Image Editing Text to Video Image to Video Text to Speech Full Body Animation Text to Music

Rank	Company	Model	Score
	ByteDance Seed	Seedance 2.0	94
	Alibaba	HappyHorse 1.0	82
	Kuaishou	Kling 3.0	82
4	OpenAI	Sora 2	79
5	Lightricks	LTX-2	75
6	Kuaishou	Kling 2.6	73
7	MiniMax	Hailuo 2.3	71
8	Kuaishou	Kling 01	69
8	PixVerse	Pixverse V5.5	69
10	MiniMax	Hailuo 02	67
11	Google	Veo 3.1	66
11	Tencent	HunyuanVideo 1.5	66
11	Google	Veo 3	66
11	Alibaba	Wan 2.2	66
15	Meituan	LongCat-Video	65
15	ByteDance Seed	Seedance 1.0	65
17	Runway	Runway Gen4.5	64
17	Bytedance	Waver 1.0	64
17	Kuaishou	Kling 2.1	64
17	Luma Labs	Ray 3	64
17	PixVerse	PixVerse V5	64
22	Kuaishou	Kling 2.0	59
23	PixVerse	PixVerse V4.5	56
24	Alibaba	Wan 2.1	54
25	OpenAI	Sora	52
25	KlingAI	Kling 1.6	52
27	Pika Art	Pika 2.0	51
28	Vidu	Vidu Q1	50
28	Tencent	Hunyuan Video	50
30	Genmo	Mochi 1	47
31	Luma Labs	Ray 2	44

Full tutorial & review videos

Watch the videos below for comprehensive comparisons and detailed installation guides for select video generation models.

Seedance 2.0

Kling 3.0

Hunyuan Video installation & review

KLING O1 & 2.6 review

LTX-2 review

Sora 2 review

Veo 3.1 review

Wan 2.2 installation & review

Hailuo 02 review

Veo 3 review

Kling 2.0 review

LTXV 13B installation & review

Hunyuan Video installation & review

Wan VACE installation & review

Wan 2.1 installation & review

Genmo Mochi 1 review

Methodology

Models are ranked using a series of prompts involving diverse range of challenging tasks. This includes:

Prompt adherence and world understanding
Motion and physics
Character consistency
Scene transitions

Camera movement and angles
Lighting and shadows
Text and object generation
NSFW capabilities

To prevent manipulation, the prompts are kept confidential and are regularly updated to increase difficulty as models improve. Here is a subset of prompts for your reference:

A man riding a unicycle and juggling red balls

Will Smith eating spaghetti

A princess wearing a glittery white dress. She is running away from a massive red dragon with glowing red eyes. 3D disney pixar style

A gymnast performs a flip on a balance beam

A professor writes "hello" on the chalkboard

A swarm of zombies causing chaos in a shopping mall, shaky camera

A cat roars while looking at its reflection in the mirror but instead sees itself as a lion roaring

A neon sign with the text "Subscribe to my channel". Cyberpunk city at night.