view article Article Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models Nov 19, 2025 • 34
Running 77 Unlocking On-Policy Distillation for Any Model Family 📝 77 Improve model performance by transferring knowledge between different model families
Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents Paper • 2510.14967 • Published Oct 16, 2025 • 34