Running on CPU Upgrade Featured 2.95k The Smol Training Playbook 📚 2.95k The secrets to building world-class LLMs
Reasoning Datasets Collection Distilled synthetic Reasoning datasets • 7 items • Updated Feb 2, 2025 • 61
chriswhpang/Llama-3.2-1B-Instruct-OpenThought-SFT-GRPO-16bit Text Generation • 1B • Updated Feb 19, 2025 • 4 •
chriswhpang/Llama-3.2-1B-Instruct-OpenThought-SFT-GRPO-16bit Text Generation • 1B • Updated Feb 19, 2025 • 4 •
chriswhpang/Llama-3.2-1B-Instruct-OpenThought-SFT-VLLM Text Generation • 1B • Updated Feb 17, 2025 • 3 •
chriswhpang/Llama-3.2-1B-Instruct-OpenThought-SFT-VLLM Text Generation • 1B • Updated Feb 17, 2025 • 3 •
🪐 SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated May 5, 2025 • 243