Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
uiuc-kang-lab 's Collections
RL Generalizability
BIRD-Platinum

RL Generalizability

updated Nov 20, 2025
Upvote
-

  • uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-step-100

    Updated Nov 14, 2025

  • uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-dapo

    2B • Updated Nov 15, 2025 • 3

  • uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-12-6

    2B • Updated Nov 16, 2025 • 3

  • uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-10-6

    2B • Updated Nov 16, 2025 • 3

  • uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-8-6

    2B • Updated Nov 16, 2025 • 3

  • uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-6-6

    2B • Updated Nov 16, 2025 • 3

  • uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-4-6

    2B • Updated Nov 16, 2025 • 3

  • uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-11-6

    2B • Updated Nov 16, 2025 • 2

  • uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-9-6

    2B • Updated Nov 16, 2025 • 3

  • uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-7-6

    2B • Updated Nov 16, 2025 • 3

  • uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-5-6

    2B • Updated Nov 16, 2025 • 3

  • uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-3-6

    2B • Updated Nov 16, 2025 • 3

  • uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-1-6

    2B • Updated Nov 16, 2025 • 2

  • uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-2-6

    2B • Updated Nov 16, 2025 • 3

  • uiuc-kang-lab/Llama3.2-3B-Instruct-math

    3B • Updated Nov 18, 2025 • 3

  • uiuc-kang-lab/R1-Distill-Qwen-1.5B-mixed

    2B • Updated Nov 20, 2025 • 3
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs