Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling Paper • 2604.28185 • Published 11 days ago • 87
Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling Paper • 2604.28185 • Published 11 days ago • 87
Vision-Language-Action Safety: Threats, Challenges, Evaluations, and Mechanisms Paper • 2604.23775 • Published 15 days ago • 44
AutoMIA: Improved Baselines for Membership Inference Attack via Agentic Self-Exploration Paper • 2604.01014 • Published Apr 1 • 11
WristWorld: Generating Wrist-Views via 4D World Models for Robotic Manipulation Paper • 2510.07313 • Published Oct 8, 2025 • 7
Query-Kontext: An Unified Multimodal Model for Image Generation and Editing Paper • 2509.26641 • Published Sep 30, 2025 • 4
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling Paper • 2511.11793 • Published Nov 14, 2025 • 195
SAMA: Factorized Semantic Anchoring and Motion Alignment for Instruction-Guided Video Editing Paper • 2603.19228 • Published Mar 19 • 68
MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification Paper • 2603.15726 • Published Mar 16 • 186
MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier Paper • 2603.03756 • Published Mar 4 • 89
Geometry-Aware Rotary Position Embedding for Consistent Video World Model Paper • 2602.07854 • Published Feb 8 • 10
Geometry-Aware Rotary Position Embedding for Consistent Video World Model Paper • 2602.07854 • Published Feb 8 • 10