Guava: An Effective and Universal Harness for Embodied Manipulation Paper • 2606.18363 • Published 9 days ago • 28
Beyond Scalar Rewards by Internalizing Reasoning into Score Distributions Paper • 2606.09076 • Published 17 days ago • 61
LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards Paper • 2605.31584 • Published 27 days ago • 42
System-2 Mathematical Reasoning via Enriched Instruction Tuning Paper • 2412.16964 • Published Dec 22, 2024 • 2
WALL-E 2.0: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents Paper • 2504.15785 • Published Apr 22, 2025 • 22
SeeNav-Agent: Enhancing Vision-Language Navigation with Visual Prompt and Step-Level Policy Optimization Paper • 2512.02631 • Published Dec 2, 2025 • 9
GTR-Turbo: Merged Checkpoint is Secretly a Free Teacher for Agentic VLM Training Paper • 2512.13043 • Published Dec 15, 2025 • 6
UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience Paper • 2603.24533 • Published Mar 25 • 47
UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience Paper • 2603.24533 • Published Mar 25 • 47
GTR-Turbo: Merged Checkpoint is Secretly a Free Teacher for Agentic VLM Training Paper • 2512.13043 • Published Dec 15, 2025 • 6
SeeNav-Agent: Enhancing Vision-Language Navigation with Visual Prompt and Step-Level Policy Optimization Paper • 2512.02631 • Published Dec 2, 2025 • 9
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer Paper • 2511.22699 • Published Nov 27, 2025 • 247