Not All Correct Answers Are Equal: Why Your Distillation Source Matters

Xiaoyu Tian, Yunjie Ji, Haotian Wang, Shuaiting Chen, Sitong Zhao, Yiping Peng, Han Zhao, Xiangang Li

2025-05-21

Not All Correct Answers Are Equal: Why Your Distillation Source Matters

Summary

This paper talks about how the quality of answers used to train smaller AI models, known as student models, really matters, especially when those answers come from more advanced language models.

What's the problem?

The problem is that not all correct answers are equally helpful for teaching AI. Sometimes, even if an answer is technically right, it might not show good reasoning or clear thinking, which can lead to weaker student models that don't learn as well.

What's the solution?

To solve this, the researchers showed that using reasoning data—answers that explain the thought process—from advanced language models makes student models perform much better on a variety of tests, because they learn not just the answer but how to think through the problem.

Why it matters?

This matters because it means we can build smarter and more reliable AI by carefully choosing the right kind of training data, leading to technology that can explain itself and make better decisions in real-world situations.

Abstract

Distilling reasoning data from advanced language models improves student model performance across various benchmarks.

View Paper