Qwen-Image-Agent: Bridging the Context Gap in Real-World Image Generation Paper • 2606.26907 • Published 4 days ago • 41
DreamX-World 1.0: A General-Purpose Interactive World Model Paper • 2606.16993 • Published 14 days ago • 112
JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence Paper • 2606.14777 • Published 19 days ago • 204
SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning Paper • 2606.13673 • Published 18 days ago • 109
Echo-Memory: A Controlled Study of Memory in Action World Models Paper • 2606.09803 • Published 21 days ago • 32
Native Multimodal Model Zoo Collection Native Multimodal Model Zoo is a curated collection of native multimodal models for seamless understanding and generation across diverse modalities. • 15 items • Updated 23 days ago • 1
Unified Multimodal Model Zoo Collection Unified Multimodal Model Zoo is a curated collection of UMMs that bridge understand and generation through a shared multimodal architecture. • 59 items • Updated 23 days ago • 1
3D Model Zoo Collection 3D Model Zoo is a curated collection of 3D generative models for capability exploration and systematic evaluation. • 39 items • Updated 23 days ago • 2
Video Model Zoo Collection Video Model Zoo is a curated collection of video generative models for capability exploration and systematic evaluation. • 105 items • Updated 23 days ago • 1
World Model Zoo Collection World Model Zoo is a curated collection of generative world models for interactive exploration and systematic evaluation. • 45 items • Updated 23 days ago • 2
Auto-Rubric as Reward: From Implicit Preferences to Explicit Multimodal Generative Criteria Paper • 2605.08354 • Published May 8 • 23
HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds Paper • 2604.14268 • Published Apr 15 • 127
Seedance 2.0: Advancing Video Generation for World Complexity Paper • 2604.14148 • Published Apr 15 • 167
GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents Paper • 2604.07429 • Published Apr 8 • 123
SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization Paper • 2604.02268 • Published Apr 2 • 103
GEMS: Agent-Native Multimodal Generation with Memory and Skills Paper • 2603.28088 • Published Mar 30 • 87