Running RL Featured 73 Survival Island Game 🏝 73 Watch an AI agent survive on a procedurally generated island
Running Featured 78 Distilling 100B+ Models 40x Faster with TRL 📝 78 TRL distillation for 100B+ teachers, 40x faster
Gemma 4 Collection Gemma 4 is Google's new model family including including E2B, E4B, 26B-A4B, and 31B. • 28 items • Updated 18 days ago • 176
Transformers.js V4 demos Collection A collection of demos built with Transformers.js V4 • 24 items • Updated 24 days ago • 58
Running on CPU Upgrade 232 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens 📝 232 Explore synthetic data experiments on a virtual bookshelf
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text • 28B • Updated Apr 6 • 267k • 2.83k
Qwen3.5 Collection Qwen3.5 is Qwen's new model family including Qwen3.5 Small: 0.8B, 2B, 4B, 9B and Qwen3.5 Medium: 35B-A3B, 27B, 122B-A10B and 397B-A17B. • 25 items • Updated 18 days ago • 152