Running 3.92k The Ultra-Scale Playbook π 3.92k The ultimate guide to training LLM on large GPU Clusters
Running on CPU Upgrade Featured 3.22k The Smol Training Playbook π 3.22k The secrets to building world-class LLMs
google/gemma-3-4b-it-qat-int4-unquantized Image-Text-to-Text β’ 4B β’ Updated Apr 15, 2025 β’ 55 β’ β’ 10
Running 601 Scaling test-time compute π 601 Boost LLM answers with flexible testβtime search strategies
Running on CPU Upgrade Agents Featured 1.01k Model Memory Utility π 1.01k Calculate GPU memory needed for training Hugging Face models