view article Article Is it agentic enough? Benchmarking open models on your own tooling +1 lysandre, SaylorTwift, pcuenq • 12 days ago • 19
Running 194 The ultimate guide to RL environments: building and scaling them in the LLM era 📝 194 Building and scaling RL environments for LLM training
view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand qgallouedec • Dec 4, 2025 • 72
view article Article DeepSeek-V4: a million-token context that agents can actually use burtenshaw • Apr 24 • 50
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego • Mar 10 • 165
Running on CPU Upgrade Featured 3.22k The Smol Training Playbook 📚 3.22k The secrets to building world-class LLMs
view article Article How to Use Multiple GPUs in Hugging Face Transformers: Device Map vs Tensor Parallelism ariG23498 • Feb 12 • 20
view article Article The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+ huggingface • Feb 3 • 53