OpenThoughts-Agent: Data Recipes for Agentic Models Paper • 2606.24855 • Published 4 days ago • 43
Running 27 Weight-Space Geometry of Offline Reasoning Training 🧠27 Interactive weight-space geometry of six reasoning losses
On Problems of Implicit Context Compression for Software Engineering Agents Paper • 2605.11051 • Published May 11
On Problems of Implicit Context Compression for Software Engineering Agents Paper • 2605.11051 • Published May 11 • 1
SWE-rebench-V2 Collection SWE-rebench-V2 is a curated dataset of software-engineering tasks derived from real GitHub issues and pull requests. • 3 items • Updated Mar 3 • 18
view article Article Mixture of Experts (MoEs) in Transformers +5 ariG23498, pcuenq, merve, IlyasMoutawwakil, ArthurZ, sergiopaniego, Molbap • Feb 26 • 169
view article Article Mixture of Experts Explained +4 osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq • Dec 11, 2023 • 1.15k
view article Article We Got Claude to Fine-Tune an Open Source LLM burtenshaw, evalstate • Dec 4, 2025 • 630
Running on CPU Upgrade Featured 3.22k The Smol Training Playbook 📚 3.22k The secrets to building world-class LLMs
view article Article Granite 4.0 Nano: Just how small can you go? ibm-granite • Oct 28, 2025 • 125
🦫 PIPer Collection All the resources for our paper "PIPer: On-Device Environment Setup via Online Reinforcement Learning"! • 9 items • Updated Oct 1, 2025 • 3
PIPer: On-Device Environment Setup via Online Reinforcement Learning Paper • 2509.25455 • Published Sep 29, 2025 • 38
view article Article CircleGuardBench: New Standard for Evaluating AI Moderation Models whitecircle • May 7, 2025 • 59
Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers Paper • 2504.20752 • Published Apr 29, 2025 • 96