Running Featured 41 Porting nanochat to Transformers: an AI modeling history lesson 📝 41 Learn about ML and Transformers through nanochat
Running on CPU Upgrade Featured 2.58k The Smol Training Playbook 📚 2.58k The secrets to building world-class LLMs
view article Article huggingface_hub v1.0: Five Years of Building the Foundation of Open Machine Learning +2 Oct 27 • 73
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs Paper • 2510.11696 • Published Oct 13 • 176
Language Models Can Learn from Verbal Feedback Without Scalar Rewards Paper • 2509.22638 • Published Sep 26 • 70
A Survey of Reinforcement Learning for Large Reasoning Models Paper • 2509.08827 • Published Sep 10 • 189