Shifting AI Efficiency From Model-Centric to Data-Centric Compression

Xuyang Liu, Zichen Wen, Shaobo Wang, Junjie Chen, Zhishan Tao, Yubo Wang, Xiangqi Jin, Chang Zou, Yiyu Wang, Chenfei Liao, Xu Zheng, Honggang Chen, Weijia Li, Xuming Hu, Conghui He, Linfeng Zhang

2025-05-27

Shifting AI Efficiency From Model-Centric to Data-Centric Compression

Summary

This paper talks about a new approach in AI research that aims to make artificial intelligence systems more efficient by focusing on compressing the data they use, rather than just making the models themselves smaller.

What's the problem?

The problem is that when AI models have to deal with a lot of information at once, especially in situations where they need to remember and process long conversations or documents, they can become slow and require a lot of computer power. Traditionally, most efforts have been about shrinking the models, but this doesn't fully solve the issue.

What's the solution?

The researchers suggest shifting attention to data-centric compression, especially by compressing the tokens, which are the basic pieces of information the AI uses to understand language. By making the data itself more compact, the AI can handle longer contexts more efficiently without losing important details.

Why it matters?

This is important because it could make AI systems faster and able to work with much bigger and more complicated tasks, like summarizing books or holding long conversations, all while using less energy and resources.

Abstract

The focus in AI research is shifting from model-centric to data-centric compression, with token compression identified as key to improving efficiency in handling long-context scenarios.

View Paper