< Explain other AI papers

Scaling Image and Video Generation via Test-Time Evolutionary Search

Haoran He, Jiajun Liang, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai, Ling Pan

2025-05-26

Scaling Image and Video Generation via Test-Time Evolutionary Search

Summary

This paper talks about EvoSearch, a new method that helps AI models make better images and videos by using an approach inspired by evolution, especially when the model is actually generating the content.

What's the problem?

The problem is that while AI models for creating images and videos have gotten really good, it's still hard to improve their quality and variety without making the models bigger or retraining them, which can be expensive and slow. Existing methods for boosting performance when the model is running often only work for specific tasks, don't scale well, or end up making all the results look too similar.

What's the solution?

The researchers created EvoSearch, which treats the process of generating images and videos like an evolutionary process. It uses ideas like selection and mutation to explore different ways to clean up and improve the images or videos as they're being made. This approach doesn't need extra training or bigger models, and it keeps the results diverse and high quality by constantly refining and selecting the best outputs during generation.

Why it matters?

This is important because it means we can get much better and more creative images and videos from AI without needing more powerful computers or longer training times, making advanced content creation faster, cheaper, and more accessible.

Abstract

EvoSearch, an evolutionary search method, enhances test-time scaling for diffusion and flow-based generative models, improving image and video generation quality, diversity, and generalizability.