Audit & Repair: An Agentic Framework for Consistent Story Visualization in Text-to-Image Diffusion Models
Kiymet Akdemir, Tahira Kazimi, Pinar Yanardag
2025-06-24
Summary
This paper talks about Audit & Repair, a system that uses multiple AI agents working together to create consistent stories in images generated from text, especially when making several panels that tell a story.
What's the problem?
The problem is that when AI models generate multiple images for a story, the images often don't look consistent with each other, like characters or scenes changing strangely between panels.
What's the solution?
The researchers designed a multi-agent framework where different AI models collaborate to check and fix inconsistencies across the story’s images. They use diffusion models like Flux and Stable Diffusion to repair parts of the images that don’t match the story well.
Why it matters?
This matters because it helps make AI-generated story images look more natural and connected, improving the quality and clarity of visual storytelling in comics, animations, and other media.
Abstract
A collaborative multi-agent framework improves consistency in multi-panel story visualizations by refining inconsistencies across panels using diffusion models like Flux and Stable Diffusion.