The Hitchhiker's Guide to Agentic AI: From Foundations to Systems Paper • 2606.24937 • Published 5 days ago • 14
Wan-Streamer v0.1: End-to-end Real-time Interactive Foundation Models Paper • 2606.25041 • Published 4 days ago • 81
Qwen-AgentWorld: Language World Models for General Agents Paper • 2606.24597 • Published 4 days ago • 126
GeneralVLA-2: Geometry-Aware Reconstruction and Governed Memory for Robot Planning Paper • 2606.17480 • Published 11 days ago • 3
PerceptionDLM: Parallel Region Perception with Multimodal Diffusion Language Models Paper • 2606.19534 • Published 10 days ago • 62
MaineCoon: Pursuing A Real-Time Audio-Visual Social World Model Paper • 2606.17800 • Published 11 days ago • 13
DF3DV-1K: A Large-Scale Dataset and Benchmark for Distractor-Free Novel View Synthesis Paper • 2604.13416 • Published 9 days ago • 32
HumanScale: Egocentric Human Video Can Outperform Real-Robot Data for Embodied Pretraining Paper • 2606.20521 • Published 9 days ago • 13
DragMesh-2: Physically Plausible Dexterous Hand-Object Interaction with Articulated Objects Paper • 2606.15133 • Published 14 days ago • 73
JanusMesh: Fast and Zero-Shot 3D Visual Illusion Generation via Cross-Space Denoising Paper • 2606.20563 • Published 9 days ago • 20
S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence Paper • 2606.20515 • Published 9 days ago • 39
Adaptive Volumetric Mechanical Property Fields Invariant to Resolution Paper • 2606.18231 • Published 11 days ago • 5
Holo-World: Unified Camera, Object and Weather Control for Video World Model Paper • 2606.20083 • Published 9 days ago • 9
ENPIRE: Agentic Robot Policy Self-Improvement in the Real World Paper • 2606.19980 • Published 9 days ago • 14
From Trainee to Trainer: LLM-Designed Training Environment for RL with Multi-Agent Reasoning Paper • 2606.17682 • Published 11 days ago • 26