Sample-Efficient Post-Training for LEGO Spatial-Physics Reasoning Paper • 2606.07602 • Published 12 days ago • 4
SwarmSys: Decentralized Swarm-Inspired Agents for Scalable and Adaptive Reasoning Paper • 2510.10047 • Published Oct 11, 2025 • 15
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs Paper • 2412.18925 • Published Dec 25, 2024 • 107