Memory-Efficient Looped Transformer: Decoupling Compute from Memory in Looped Language Models Paper • 2605.07721 • Published May 8 • 29
Running on CPU Upgrade Featured 3.22k The Smol Training Playbook 📚 3.22k The secrets to building world-class LLMs
Running Agents 50 Leaderboard: Physical Reasoning from Video 🏃 50 Submit model evaluations and view leaderboard results