Can Compressed LLMs Truly Act? An Empirical Evaluation of Agentic Capabilities in LLM Compression

Peijie Dong, Zhenheng Tang, Xiang Liu, Lujun Li, Xiaowen Chu, Bo Li

2025-05-28

Can Compressed LLMs Truly Act? An Empirical Evaluation of Agentic
Capabilities in LLM Compression

Summary

This paper talks about ACBench, a new way to test how compressing large language models affects their ability to act like smart agents, such as planning, using tools, understanding long texts, and solving real-world problems.

What's the problem?

The problem is that while making language models smaller and more efficient is great for saving computer resources, most tests only check if the models can still handle basic language tasks, not whether they can do more complex things that require decision-making and reasoning.

What's the solution?

To solve this, the researchers created ACBench, which tests compressed models on a variety of agent-like skills, including planning workflows, using tools, handling long pieces of information, and working in real-world scenarios. They also introduced new ways to measure how well the compressed models keep their abilities compared to the original versions.

Why it matters?

This is important because it helps developers know how much they can shrink these models without losing important abilities, making it possible to use powerful AI in places where computer power is limited, like on personal devices or in robots.

Abstract

ACBench evaluates the impact of compression on the agentic capabilities of large language models, focusing on workflow generation, tool use, long-context understanding, and real-world application.

View Paper