NatureBench: Can Coding Agents Match the Published SOTA of Nature-Family Papers? Paper • 2606.24530 • Published 3 days ago • 53
Qwen-AgentWorld: Language World Models for General Agents Paper • 2606.24597 • Published 3 days ago • 105
NatureBench: Can Coding Agents Match the Published SOTA of Nature-Family Papers? Paper • 2606.24530 • Published 3 days ago • 53
NatureBench: Can Coding Agents Match the Published SOTA of Nature-Family Papers? Paper • 2606.24530 • Published 3 days ago • 53
Attention as a Compass: Efficient Exploration for Process-Supervised RL in Reasoning Models Paper • 2509.26628 • Published Sep 30, 2025 • 17
From Perception to Cognition: A Survey of Vision-Language Interactive Reasoning in Multimodal Large Language Models Paper • 2509.25373 • Published Sep 29, 2025
Emotion-Director: Bridging Affective Shortcut in Emotion-Oriented Image Generation Paper • 2512.19479 • Published Dec 22, 2025 • 1
PRISMA: Reinforcement Learning Guided Two-Stage Policy Optimization in Multi-Agent Architecture for Open-Domain Multi-Hop Question Answering Paper • 2601.05465 • Published Jan 9
MARS$^2$: Scaling Multi-Agent Tree Search via Reinforcement Learning for Code Generation Paper • 2604.14564 • Published Apr 16 • 1
EnterpriseClawBench: Benchmarking Agents from Real Workplace Sessions Paper • 2606.23654 • Published 4 days ago • 74
EnterpriseClawBench: Benchmarking Agents from Real Workplace Sessions Paper • 2606.23654 • Published 4 days ago • 74
EnterpriseClawBench: Benchmarking Agents from Real Workplace Sessions Paper • 2606.23654 • Published 4 days ago • 74
Post-Trained MoE Can Skip Half Experts via Self-Distillation Paper • 2605.18643 • Published May 18 • 30
Post-Trained MoE Can Skip Half Experts via Self-Distillation Paper • 2605.18643 • Published May 18 • 30