Running on CPU Upgrade 262 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens π 262 Visualize syntheticβdata experiments as an interactive bookshelf
Running 46 Falcon-H1-Tiny: A series of extremely small, yet powerful language models redefining capabilities at small scale π 46 Generate text using extremely small yet powerful language models
Running on CPU Upgrade Featured 3.22k The Smol Training Playbook π 3.22k The secrets to building world-class LLMs
scottgeng00/olmo-3-preference-mix-deltas_reasoning-chosen_qwen4b-yolo_scottmix-DECON Viewer β’ Updated Sep 25, 2025 β’ 294k β’ 78 β’ 1
open-r1/SYNTHETIC-1-SFT-Data-Code_decontaminated Viewer β’ Updated Feb 24, 2025 β’ 49.7k β’ 12 β’ 3