Running Featured 90 Distilling 100B+ Models 40x Faster with TRL 📝 90 TRL distillation for 100B+ teachers, 40x faster
nvidia/Nemotron-Research-Reasoning-Qwen-1.5B Text Generation • 2B • Updated Nov 21, 2025 • 3.23k • 243