reasoning-gym Collection Datasets generated using https://github.com/open-thought/reasoning-gym (with Qwen3-instruct templates) • 15 items • Updated 12 days ago
Running on CPU Upgrade Featured 3.15k The Smol Training Playbook 📚 3.15k The secrets to building world-class LLMs
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22, 2025 • 449