DR Tulu

NEW

Free Research Deep Learning

LikeWebsite Promote

Key Features

Open-source end-to-end training framework for long-form deep research

Combines supervised fine-tuning with Reinforcement Learning with Evolving Rubrics

Flexible agent stack supports multi-tool searches and asynchronous tool calls

Includes dr-agent-lib for concurrency management and multi-tool integration

Modular design enables deployment with custom tool stacks and domain-specific extensions

High performance on industry benchmarks outperforming larger proprietary models

Starting from a strong base model, DR Tulu undergoes multiple stages of training, which include prompt curation, supervised fine-tuning with teacher-generated trajectories to establish foundational research skills, and reinforcement learning with evolving reward frameworks that focus on improving tool usage, synthesis quality, and citation behavior. The model is designed to integrate with a flexible agent stack that enables it to dynamically choose among various search and browsing tools, enhancing its ability to gather and synthesize information from diverse sources efficiently.

One of DR Tulu's standout features is its modularity and extensibility; it includes an agent library called dr-agent-lib that provides a multi-tool, asynchronous calling framework to manage concurrency and caching effectively. This empowers users to deploy the agent with their own custom tool stacks, achieve reproducibility through accessible training recipes and checkpoints, and extend the model's capabilities by plugging in domain-specific retrieval systems without the need for retraining. DR Tulu’s best-performing 8-billion-parameter model demonstrates notable improvements over larger proprietary systems in rigorous benchmarks, all while maintaining cost-effectiveness and deployment flexibility.

Get more likes & reach the top of search results by adding this button on your site!

DR Tulu

Key Features

Subscribe to the AI Search Newsletter