THOUGHTTERMINATOR: Benchmarking, Calibrating, and Mitigating Overthinking in Reasoning Models

Xiao Pu, Michael Saxon, Wenyue Hua, William Yang Wang

2025-04-22

THOUGHTTERMINATOR: Benchmarking, Calibrating, and Mitigating
Overthinking in Reasoning Models

Summary

This paper talks about THOUGHTTERMINATOR, a new method that helps AI reasoning models avoid 'overthinking' and use just the right amount of effort to answer both easy and hard questions.

What's the problem?

The problem is that AI models sometimes spend too much time and use too many steps on simple questions, which wastes resources and can even lead to mistakes, while on hard questions, they might not try hard enough and miss the correct answer.

What's the solution?

The researchers introduced THOUGHTTERMINATOR, a special decoding technique that doesn't need extra training. It helps the model decide how many steps or tokens to use depending on how tricky the problem is, making the model better at knowing when to stop thinking and give an answer.

Why it matters?

This matters because it makes AI models faster, more efficient, and more accurate, which is important for using them in real-world situations where both speed and reliability are needed.

Abstract

THOUGHTTERMINATOR, a training-free decoding technique, improves the calibration of reasoning models by optimizing token usage across easy and difficult problems.

View Paper