DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research
Paper
• 2511.19399 • Published
• 62
Models and data associated with DR Tulu, http://allenai-web/papers/drtulu
Note Our paper!
Note Final RLER-trained model.
Note SFT model.
Note Data used for SFT training.
Note Data used for RL training.
Note Ablation model, trained with RL without RLER.