view article Article Beyond LoRA: Can you beat the most popular fine-tuning technique? +2 BenjaminB, sayakpaul, hubnemo, kashif • 7 days ago • 60
view article Article Profiling in PyTorch (Part 2): From nn.Linear to a Fused MLP +3 ariG23498, ror, sergiopaniego, pcuenq, sayakpaul • 14 days ago • 46
view article Article The Open Source Community is backing OpenEnv for Agentic RL +17 burtenshaw, spisakjo, lysandre, darktex, willcb, qjoy, pawalt, cwing-nv, danielhanchen, andrewzhou, thegovind, shimmyshimmer, Hamid-Nazeri, Sanyam, zkwentz, emre0, lewtun, sergiopaniego, banghua • 17 days ago • 91
view article Article Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler +3 ariG23498, sayakpaul, sergiopaniego, ror, pcuenq • 27 days ago • 127
view article Article Unlocking asynchronicity in continuous batching +1 ror, pcuenq, ariG23498 • May 14 • 61
Sleeping RL 1 Price Negotiation Environment Server 🎭 1 Negotiate prices in an interactive buyer simulation
view article Article DeepSeek-V4: a million-token context that agents can actually use burtenshaw • Apr 24 • 50
view article Article Announcing the Hugging Face Fellowship Program merve, espejelomar • May 17, 2022 • 16
Sleeping RL 1 Price Negotiation Environment Server 🎭 1 Negotiate prices in an interactive buyer simulation
Sleeping RL 1 Price Negotiation Environment Server 🎭 1 Negotiate prices in an interactive buyer simulation