Odysseus: Scaling VLMs to 100+ Turn Decision-Making in Games via Reinforcement Learning Paper • 2605.00347 • Published 5 days ago • 12
From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published 3 days ago • 131
Map2World: Segment Map Conditioned Text to 3D World Generation Paper • 2605.00781 • Published 5 days ago • 22
Trees to Flows and Back: Unifying Decision Trees and Diffusion Models Paper • 2605.00414 • Published 5 days ago • 6
When Do Diffusion Models learn to Generate Multiple Objects? Paper • 2605.00273 • Published 6 days ago • 6
Web2BigTable: A Bi-Level Multi-Agent LLM System for Internet-Scale Information Search and Extraction Paper • 2604.27221 • Published 7 days ago • 32
Step-level Optimization for Efficient Computer-use Agents Paper • 2604.27151 • Published 7 days ago • 16
Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence Paper • 2604.24954 • Published 9 days ago • 19
The Last Human-Written Paper: Agent-Native Research Artifacts Paper • 2604.24658 • Published 7 days ago • 17
Efficient Training on Multiple Consumer GPUs with RoundPipe Paper • 2604.27085 • Published 7 days ago • 37
Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling Paper • 2604.28185 • Published 6 days ago • 85
Heterogeneous Scientific Foundation Model Collaboration Paper • 2604.27351 • Published 6 days ago • 202
ClawGym: A Scalable Framework for Building Effective Claw Agents Paper • 2604.26904 • Published 7 days ago • 49
Turning the TIDE: Cross-Architecture Distillation for Diffusion Large Language Models Paper • 2604.26951 • Published 7 days ago • 46
GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents Paper • 2604.26752 • Published 7 days ago • 97
From Skills to Talent: Organising Heterogeneous Agents as a Real-World Company Paper • 2604.22446 • Published 12 days ago • 119
For-Value: Efficient Forward-Only Data Valuation for finetuning LLMs and VLMs Paper • 2508.10180 • Published 11 days ago • 17