view article Article Unlocking asynchronicity in continuous batching +1 ror, pcuenq, ariG23498 • 8 days ago • 52
view article Article DeepSeek-V4: a million-token context that agents can actually use burtenshaw • 28 days ago • 46
Running 166 The ultimate guide to RL environments: building and scaling them in the LLM era 📝 166 Building and scaling RL environments for LLM training
DFlash Collection Block Diffusion for Flash Speculative Decoding • 21 items • Updated 12 days ago • 117
view article Article Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers +5 ariG23498, sergiopaniego, reach-vb, pcuenq, ArthurZ, SaylorTwift, cyrilvallez • Sep 11, 2025 • 188
Running 115 The Eiffel Tower Llama 📝 115 Explore the Eiffel Tower Llama experiment with open-source models
Running on CPU Upgrade Featured 3.18k The Smol Training Playbook 📚 3.18k The secrets to building world-class LLMs
view article Article Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training +3 smohammadi, siro1, winglian, marcsun13, djsaunde • Aug 8, 2025 • 98
Running 343 LLM Embeddings Explained: A Visual and Intuitive Guide 🚀 343 How Language Models Turn Text into Meaning, From Traditional
view article Article Gotchas in Tokenizer Behavior Every Developer Should Know qgallouedec • Apr 18, 2025 • 72
view article Article Training Large Language Models with Interpreter Feedback using WebAssembly axolotl-ai-co • Apr 3, 2025 • 14
Running 3.85k The Ultra-Scale Playbook 🌌 3.85k The ultimate guide to training LLM on large GPU Clusters
The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper • 2501.07301 • Published Jan 13, 2025 • 101