Key Features

Open-source end-to-end training framework for long-form deep research
Combines supervised fine-tuning with Reinforcement Learning with Evolving Rubrics
Flexible agent stack supports multi-tool searches and asynchronous tool calls
Includes dr-agent-lib for concurrency management and multi-tool integration
Modular design enables deployment with custom tool stacks and domain-specific extensions
High performance on industry benchmarks outperforming larger proprietary models

Starting from a strong base model, DR Tulu undergoes multiple stages of training, which include prompt curation, supervised fine-tuning with teacher-generated trajectories to establish foundational research skills, and reinforcement learning with evolving reward frameworks that focus on improving tool usage, synthesis quality, and citation behavior. The model is designed to integrate with a flexible agent stack that enables it to dynamically choose among various search and browsing tools, enhancing its ability to gather and synthesize information from diverse sources efficiently.


One of DR Tulu's standout features is its modularity and extensibility; it includes an agent library called dr-agent-lib that provides a multi-tool, asynchronous calling framework to manage concurrency and caching effectively. This empowers users to deploy the agent with their own custom tool stacks, achieve reproducibility through accessible training recipes and checkpoints, and extend the model's capabilities by plugging in domain-specific retrieval systems without the need for retraining. DR Tulu’s best-performing 8-billion-parameter model demonstrates notable improvements over larger proprietary systems in rigorous benchmarks, all while maintaining cost-effectiveness and deployment flexibility.

Get more likes & reach the top of search results by adding this button on your site!

Embed button preview - Light theme
Embed button preview - Dark theme
TurboType Banner

Subscribe to the AI Search Newsletter

Get top updates in AI to your inbox every weekend. It's free!