Towards Unified World Models for Visual Navigation via Memory-Augmented Planning and Foresight
Paper • 2510.08713 • Published • 1
Computer Vision and Deep Learning
MonoArt: Progressive Structural Reasoning for Monocular Articulated 3D Reconstruction
Bridging Semantic and Kinematic Conditions with Diffusion-based Discrete Motion Tokenizer