Running on CPU Upgrade Featured 2.92k The Smol Training Playbook π 2.92k The secrets to building world-class LLMs
Running 3.66k The Ultra-Scale Playbook π 3.66k The ultimate guide to training LLM on large GPU Clusters
Scaling Test-Time Compute with Open Models Collection Models and datasets used in our blog post: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute β’ 10 items β’ Updated Jan 6, 2025 β’ 29
Running Featured 1.27k FineWeb: decanting the web for the finest text data at scale π· 1.27k Generate high-quality text data for LLMs using FineWeb