Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
knoveleng 's Collections
polyglot-lion-v1.5
polyglot-lion
Multilingual Datasets for Singapore
Mathematics Benchmark Datasets
Open-RS

Open-RS

updated Mar 21, 2025

Model weights & datasets in the paper "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t"

Upvote
13

  • Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

    Paper • 2503.16219 • Published Mar 20, 2025 • 52

  • knoveleng/OpenRS-GRPO

    Text Generation • 2B • Updated Mar 18 • 14 • 5

  • knoveleng/Open-RS1

    Text Generation • 2B • Updated Mar 18 • 120 • 4

  • knoveleng/Open-RS2

    Text Generation • 2B • Updated Mar 18 • 108 • 1

  • knoveleng/Open-RS3

    Text Generation • 2B • Updated Mar 18 • 296 • • 21

  • knoveleng/open-rs

    Viewer • Updated Mar 18 • 7k • 1.37k • 11

  • knoveleng/open-s1

    Viewer • Updated Mar 18 • 18.6k • 327 • 4

  • knoveleng/open-deepscaler

    Viewer • Updated Mar 18 • 21k • 1.15k • 4
Upvote
13
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs