DiSA: Diffusion Step Annealing in Autoregressive Image Generation

Qinyu Zhao, Jaskirat Singh, Ming Xu, Akshay Asthana, Stephen Gould, Liang Zheng

2025-05-27

DiSA: Diffusion Step Annealing in Autoregressive Image Generation

Summary

This paper talks about DiSA, a new technique that makes image-generating AI models work faster by smartly cutting down on the number of steps they need to create each part of an image, without losing quality.

What's the problem?

The problem is that autoregressive image generation models, which build images piece by piece, usually take a lot of time and computer power because they use the same number of steps for every part of the image, even when it's not necessary. This makes the whole process slow and less efficient.

What's the solution?

The authors introduced diffusion step annealing, which means the model starts with more steps when generating the first parts of an image but then uses fewer and fewer steps as it fills in the rest. This saves time and resources while still keeping the final image looking good.

Why it matters?

This is important because it allows AI to generate high-quality images much more quickly, making these models more practical for real-world uses like art, design, and media production.

Abstract

Diffusion step annealing enhances inference efficiency in autoregressive models by reducing the number of diffusion steps as more tokens are generated, preserving quality.

View Paper