Running on CPU Upgrade Featured 2.58k The Smol Training Playbook 📚 2.58k The secrets to building world-class LLMs
Reward Models 10-2025 Collection A collection of great reward models for research and production • 7 items • Updated 8 days ago • 9
Evaluation is All You Need: Strategic Overclaiming of LLM Reasoning Capabilities Through Evaluation Design Paper • 2506.04734 • Published Jun 5 • 20