QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design
Benjamin Schneider, Dongfu Jiang, Chao Du, Tianyu Pang, Wenhu Chen
2025-05-23
Summary
This paper talks about QuickVideo, a new system that helps computers understand long videos really fast, so they can analyze and react to what's happening almost in real time.
What's the problem?
Most video analysis systems are too slow to handle long videos quickly, which makes it hard to use them for things like live events, security, or any situation where you need instant feedback.
What's the solution?
The researchers designed QuickVideo with smarter ways to process video, like breaking up the decoding work so it happens at the same time as the computer is figuring out what's in the video, and using memory more efficiently so nothing slows down.
Why it matters?
This matters because it makes it possible to use AI for real-time video understanding in things like sports broadcasts, live surveillance, and interactive entertainment, opening up new possibilities for technology and safety.
Abstract
QuickVideo accelerates long-video understanding by combining a parallelized video decoder, memory-efficient prefilling, and overlapping video decoding with inference, enabling real-time performance.