Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
rl-research 's Collections
DR Tulu

DR Tulu

updated Feb 24

Models and data associated with DR Tulu, http://allenai-web/papers/drtulu

Upvote
37

  • DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

    Paper • 2511.19399 • Published Nov 24, 2025 • 63

    Note Our paper!


  • rl-research/DR-Tulu-8B

    Text Generation • 8B • Updated Feb 24 • 2.25k • • 73

    Note Final RLER-trained model.


  • rl-research/DR-Tulu-SFT-8B

    Text Generation • 8B • Updated Nov 29, 2025 • 194 • • 5

    Note SFT model.


  • rl-research/dr-tulu-sft-data

    Viewer • Updated Nov 25, 2025 • 13.1k • 284 • 29

    Note Data used for SFT training.


  • rl-research/dr-tulu-rl-data

    Viewer • Updated Nov 25, 2025 • 4.88k • 531 • 13

    Note Data used for RL training.


  • rl-research/DR-Tulu-No-RLER-8B

    Text Generation • 8B • Updated Feb 24 • 14 •

    Note Ablation model, trained with RL without RLER.

Upvote
37
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs