Visual Para-Thinker++: A Single-Policy Multi-Agent Framework for Visual Reasoning Paper • 2606.09290 • Published 2 days ago • 6
SG-OPD: Sign-Gated On-Policy Distillation via Sign-Consistency Gating and Phased Teacher Sampling Paper • 2606.09304 • Published 2 days ago • 5