Running 44 The ultimate guide to RL environments: building and scaling them in the LLM era 📝 44 Building and scaling RL environments for LLM training
Running 16 Defeating the trainer-generator precision mismatch in TRL 🎯 16 Download research PDF (Pro access required)
Running Featured 77 Distilling 100B+ Models 40x Faster with TRL 📝 77 TRL distillation for 100B+ teachers, 40x faster
google/timesfm-2.5-200m-transformers Time Series Forecasting • 0.2B • Updated 25 days ago • 188k • 79