Mathematics Benchmark Datasets knoveleng/AMC-23 Viewer • Updated Mar 14, 2025 • 40 • 8.96k • 1 knoveleng/Minerva-Math Viewer • Updated Mar 14, 2025 • 272 • 1.62k • 1 knoveleng/OlympiadBench Viewer • Updated Mar 14, 2025 • 675 • 2.62k • 1
Open-RS Model weights & datasets in the paper "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t" Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper • 2503.16219 • Published Mar 20, 2025 • 52 knoveleng/OpenRS-GRPO Text Generation • 2B • Updated Mar 21, 2025 • 5 • 5 knoveleng/Open-RS1 Text Generation • 2B • Updated Mar 24, 2025 • 7 • 4 knoveleng/Open-RS2 Text Generation • 2B • Updated Mar 24, 2025 • 11 • 1
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper • 2503.16219 • Published Mar 20, 2025 • 52
Mathematics Benchmark Datasets knoveleng/AMC-23 Viewer • Updated Mar 14, 2025 • 40 • 8.96k • 1 knoveleng/Minerva-Math Viewer • Updated Mar 14, 2025 • 272 • 1.62k • 1 knoveleng/OlympiadBench Viewer • Updated Mar 14, 2025 • 675 • 2.62k • 1
Open-RS Model weights & datasets in the paper "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t" Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper • 2503.16219 • Published Mar 20, 2025 • 52 knoveleng/OpenRS-GRPO Text Generation • 2B • Updated Mar 21, 2025 • 5 • 5 knoveleng/Open-RS1 Text Generation • 2B • Updated Mar 24, 2025 • 7 • 4 knoveleng/Open-RS2 Text Generation • 2B • Updated Mar 24, 2025 • 11 • 1
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper • 2503.16219 • Published Mar 20, 2025 • 52