Distilling LLM Agent into Small Models with Retrieval and Code Tools

Minki Kang, Jongwon Jeong, Seanie Lee, Jaewoong Cho, Sung Ju Hwang

2025-05-26

Distilling LLM Agent into Small Models with Retrieval and Code Tools

Summary

This paper talks about a method called Agent Distillation that helps smaller AI models learn to solve problems and reason almost as well as much bigger, more powerful models by using smart training techniques and tools.

What's the problem?

The problem is that large language models are really good at complicated reasoning and solving tasks, but they require a lot of computing power and resources, making them hard to use for everyone. Smaller models are easier to use but usually aren't as smart or capable.

What's the solution?

The researchers developed a way to transfer the skills of big models to small ones by using better prompts and letting the small models practice with tools like retrieval and code. This process helps the small models become much better at reasoning and solving tasks, so they can perform almost as well as the large models on a variety of challenges.

Why it matters?

This is important because it means more people and organizations can use powerful AI without needing expensive hardware, making advanced AI more accessible and practical for everyday use.

Abstract

Agent Distillation transfers reasoning and task-solving capabilities from large language models to smaller models using enhanced prompts and self-consistent actions, matching performance of larger models on various reasoning tasks.

View Paper