Running Featured 1.34k FineWeb: decanting the web for the finest text data at scale π· 1.34k Explore and download the FineWeb webβtext dataset
Running 3.84k The Ultra-Scale Playbook π 3.84k The ultimate guide to training LLM on large GPU Clusters
view article Article The Optimal Architecture for Small Language Models codelion β’ Dec 26, 2025 β’ 121
Running on CPU Upgrade Featured 3.18k The Smol Training Playbook π 3.18k The secrets to building world-class LLMs
sanjay-saatyaki/details_sanjay-saatyaki__smol-train_private Viewer β’ Updated Sep 16, 2025 β’ 2.64k β’ 1
sanjay-saatyaki/details_sanjay-saatyaki__smol-train_private Viewer β’ Updated Sep 16, 2025 β’ 2.64k β’ 1