view article Article Introducing ⚔️ AI vs. AI ⚔️ a deep reinforcement learning multi-agents competition system CarlCochet, ThomasSimonini • Feb 7, 2023 • 4
view article Article Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL +6 aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, lvwerra, sergiopaniego • May 27 • 42
view article Article The Open Source Community is backing OpenEnv for Agentic RL +17 burtenshaw, spisakjo, lysandre, darktex, willcb, qjoy, pawalt, cwing-nv, danielhanchen, andrewzhou, thegovind, shimmyshimmer, Hamid-Nazeri, Sanyam, zkwentz, emre0, lewtun, sergiopaniego, banghua • 23 days ago • 102