VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models Paper • 2606.16140 • Published 12 days ago • 119
view article Article The Open Source Community is backing OpenEnv for Agentic RL +17 burtenshaw, spisakjo, lysandre, darktex, willcb, qjoy, pawalt, cwing-nv, danielhanchen, andrewzhou, thegovind, shimmyshimmer, Hamid-Nazeri, Sanyam, zkwentz, emre0, lewtun, sergiopaniego, banghua • 19 days ago • 95
On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters Paper • 2606.02437 • Published 26 days ago • 235
DR Tulu Collection Models and data associated with DR Tulu, http://allenai-web/papers/drtulu • 6 items • Updated Feb 24 • 37