view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego • Mar 10 • 153
Running 164 The ultimate guide to RL environments: building and scaling them in the LLM era 📝 164 Building and scaling RL environments for LLM training
view article Article The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU Weyaxi • Jan 2 • 22