UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors Paper • 2605.00658 • Published 7 days ago • 80
Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling Paper • 2604.28185 • Published 8 days ago • 86
SenseNova-U1 Collection SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-Unify Architecture • 4 items • Updated about 9 hours ago • 43
SenseNova-SI Collection Scaling Spatial Intelligence with Multimodal Foundation Models • 15 items • Updated 22 days ago • 19
SenseNova-SI Collection Scaling Spatial Intelligence with Multimodal Foundation Models • 15 items • Updated 22 days ago • 19
Bridging Semantic and Kinematic Conditions with Diffusion-based Discrete Motion Tokenizer Paper • 2603.19227 • Published Mar 19 • 42
MonoArt: Progressive Structural Reasoning for Monocular Articulated 3D Reconstruction Paper • 2603.19231 • Published Mar 19 • 36