ConsumerBench: Benchmarking Generative AI Applications on End-User Devices
Yile Gu, Rohan Kadekodi, Hoang Nguyen, Keisuke Kamahori, Yiyu Liu, Baris Kasikci
2025-06-24
Summary
This paper talks about ConsumerBench, a framework designed to test how well generative AI models work on everyday devices like phones and laptops by measuring their speed and efficiency in real-world situations.
What's the problem?
The problem is that generative AI models are often very demanding in terms of computing resources, and their performance on user devices hasn’t been tested thoroughly in realistic, multi-task scenarios where many applications run together.
What's the solution?
The researchers created ConsumerBench, which allows testing AI models under various real-world conditions with multiple applications running at once, tracking important factors like how fast the AI responds, how much computing power it uses, and how well it manages resources on different devices.
Why it matters?
This matters because it helps developers understand and improve how AI systems perform on actual user devices, making AI applications faster, more efficient, and better suited for everyday people.
Abstract
ConsumerBench evaluates GenAI system efficiency and response time on end-user devices through a comprehensive benchmarking framework, emphasizing realistic multi-application scenarios and customizable workflows.