Running Featured 88 Distilling 100B+ Models 40x Faster with TRL 📝 88 TRL distillation for 100B+ teachers, 40x faster
Running Agents 432 Reward Bench Leaderboard 📐 432 Explore and compare model scores on RewardBench benchmarks
Nemotron-Pre-Training-Datasets Collection Large scale pre-training datasets used in the Nemotron family of models. • 15 items • Updated 14 days ago • 170