SCOPE: Signal-Calibrated On-Policy Distillation Enhancement with Dual-Path Adaptive Weighting Paper • 2604.10688 • Published 5 days ago • 19
When to Continue Thinking: Adaptive Thinking Mode Switching for Efficient Reasoning Paper • 2505.15400 • Published May 21, 2025 • 23