DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

Zhiwei He, Tian Liang, Jiahao Xu, Qiuzhi Liu, Xingyu Chen, Yue Wang, Linfeng Song, Dian Yu, Zhenwen Liang, Wenxuan Wang, Zhuosheng Zhang, Rui Wang, Zhaopeng Tu, Haitao Mi, Dong Yu

2025-04-16

DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and
Verifiable Mathematical Dataset for Advancing Reasoning

Summary

This paper talks about DeepMath-103K, which is a huge collection of tough math problems designed to help AI models get better at solving and reasoning through complex math.

What's the problem?

The problem is that most existing math datasets for training AI are either too simple, have repeated or copied questions, or aren't good for testing if a model can really reason through hard math problems. This makes it difficult to push AI to become truly strong at advanced math reasoning.

What's the solution?

The researchers created DeepMath-103K, a new dataset with over 100,000 challenging and original math problems. They made sure the problems are not copied from other sources and are tough enough to really test and improve how well AI models can learn to reason using reinforcement learning techniques.

Why it matters?

This matters because having a better dataset helps scientists build AI that can actually understand and solve advanced math, not just memorize answers. This could lead to smarter AI that can help with real-world math problems in science, engineering, and education.

Abstract

A large-scale dataset, DeepMath-103K, comprising complex mathematical problems is introduced to enhance reinforcement learning-driven reasoning in models.

View Paper