Running 114 Unlocking On-Policy Distillation for Any Model Family 📝 114 Explore on-policy distillation visualization for any model
Running Featured 88 Distilling 100B+ Models 40x Faster with TRL 📝 88 TRL distillation for 100B+ teachers, 40x faster
unsloth/Mistral-Small-3.2-24B-Instruct-2506-unsloth-bnb-4bit Image-Text-to-Text • 25B • Updated Jun 23, 2025 • 1.86k • 13
Running on CPU Upgrade 262 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens 📝 262 Visualize synthetic‑data experiments as an interactive bookshelf
unsloth/Qwen3-VL-8B-Instruct-unsloth-bnb-4bit Image-Text-to-Text • 9B • Updated Oct 31, 2025 • 19.1k • 22
Running on CPU Upgrade Featured 3.22k The Smol Training Playbook 📚 3.22k The secrets to building world-class LLMs
intfloat/multilingual-e5-large-instruct Feature Extraction • 0.6B • Updated Jul 10, 2025 • 1.53M • • 627
unsloth/Llama-3.2-3B-Instruct-unsloth-bnb-4bit Text Generation • 3B • Updated Jun 2, 2025 • 71.1k • 10