Awaking Spatial Intelligence in Unified Multimodal Understanding and Generation Paper โข 2605.04128 โข Published 4 days ago โข 10
SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing Paper โข 2604.04911 โข Published Apr 6 โข 36
ARC-Hunyuan-Video-7B: Structured Video Comprehension of Real-World Shorts Paper โข 2507.20939 โข Published Jul 28, 2025 โข 57
MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO Paper โข 2505.13031 โข Published May 19, 2025 โข 4