Running 107 The Eiffel Tower Llama 📝 107 Explore the Eiffel Tower Llama experiment with open-source models
Running on CPU Upgrade Featured 2.98k The Smol Training Playbook 📚 2.98k The secrets to building world-class LLMs
view article Article Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training +3 Aug 8, 2025 • 92
Running 322 LLM Embeddings Explained: A Visual and Intuitive Guide 🚀 322 How Language Models Turn Text into Meaning, From Traditional
view article Article Training Large Language Models with Interpreter Feedback using WebAssembly Apr 3, 2025 • 14
Running 3.68k The Ultra-Scale Playbook 🌌 3.68k The ultimate guide to training LLM on large GPU Clusters
The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper • 2501.07301 • Published Jan 13, 2025 • 99
Running 592 Scaling test-time compute 📈 592 Run advanced LLM search strategies to boost problem solving