Breaking Failure Cascades: Step-Aware Reinforcement Learning for Medical Multimodal Reasoning Paper • 2606.31825 • Published 4 days ago • 15
TIDE: Proactive Multi-Problem Discovery via Template-Guided Iteration Paper • 2606.04743 • Published Jun 3 • 47
Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models Paper • 2506.19697 • Published Jun 24, 2025 • 45