Qwen-AgentWorld: Language World Models for General Agents Paper • 2606.24597 • Published 3 days ago • 115
DragMesh-2: Physically Plausible Dexterous Hand-Object Interaction with Articulated Objects Paper • 2606.15133 • Published 13 days ago • 72
JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence Paper • 2606.14777 • Published 16 days ago • 201
HYDRA-X: Native Unified Multimodal Models with Holistic Visual Tokenizers Paper • 2606.13289 • Published 15 days ago • 29
Hy-Embodied-0.5-VLA: From Vision-Language-Action Models to a Real-World Robot Learning Stack Paper • 2606.14409 • Published 14 days ago • 15
i1: A Simple and Fully Open Recipe for Strong Text-to-Image Models Paper • 2606.11289 • Published 17 days ago • 16
InterleaveThinker: Reinforcing Agentic Interleaved Generation Paper • 2606.13679 • Published 15 days ago • 80
EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments Paper • 2606.13681 • Published 15 days ago • 140
Struct-Searcher: Agentic Structural Thinking Advances Multimodal Deep Information Seeking Paper • 2606.07689 • Published 21 days ago • 5
MemDreamer: Decoupling Perception and Reasoning for Long Video Understanding via Hierarchical Graph Memory and Agentic Retrieval Mechanism Paper • 2606.07512 • Published 21 days ago • 39
Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models Paper • 2606.11025 • Published 17 days ago • 41
SpatialWorld: Benchmarking Interactive Spatial Reasoning of Multimodal Agents in Real-World Tasks Paper • 2606.09669 • Published 18 days ago • 45