view article Article Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL +6 aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, lvwerra, sergiopaniego • 29 days ago • 42
Running 6 Token-In, Token-Out Done Right 🧩 6 Explore an interactive simulation while reading the article
view article Article Unlocking asynchronicity in continuous batching +1 ror, pcuenq, ariG23498 • May 14 • 61
view article Article Two Years of Local AI on a Laptop: When Open Models Outpaced Moore's Law mishig • May 11 • 24
Running 192 The ultimate guide to RL environments: building and scaling them in the LLM era 📝 192 Building and scaling RL environments for LLM training