view article Article Introducing North Mini Code: Cohere’s First Model For Developers CohereLabs • 15 days ago • 74
nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16 Text Generation • 561B • Updated 15 days ago • 128k • • 245
view article Article Task-Seeded Synthetic Q&A Generation for Nemotron Pretraining nvidia • 20 days ago • 17
view article Article Harness, Scaffold, and the AI Agent Terms Worth Getting Right sergiopaniego, ariG23498 • about 1 month ago • 120
Running on CPU Upgrade Featured 403 ML Intern 🤖 403 Explore machine learning tasks via an interactive web app
The ATOM Report: Measuring the Open Language Model Ecosystem Paper • 2604.07190 • Published Apr 8 • 5
Running Featured 88 Distilling 100B+ Models 40x Faster with TRL 📝 88 TRL distillation for 100B+ teachers, 40x faster
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego • Mar 10 • 164