Disentangled Robot Learning via Separate Forward and Inverse Dynamics Pretraining Paper • 2604.16391 • Published Mar 27 • 4
Disentangled Robot Learning via Separate Forward and Inverse Dynamics Pretraining Paper • 2604.16391 • Published Mar 27 • 4
VLA-JEPA: Enhancing Vision-Language-Action Model with Latent World Model Paper • 2602.10098 • Published Feb 10 • 19
BagelVLA: Enhancing Long-Horizon Manipulation via Interleaved Vision-Language-Action Generation Paper • 2602.09849 • Published Feb 10 • 17
Evaluating Gemini Robotics Policies in a Veo World Simulator Paper • 2512.10675 • Published Dec 11, 2025 • 20
Hybrid-grained Feature Aggregation with Coarse-to-fine Language Guidance for Self-supervised Monocular Depth Estimation Paper • 2510.09320 • Published Oct 10, 2025 • 2
Hybrid-grained Feature Aggregation with Coarse-to-fine Language Guidance for Self-supervised Monocular Depth Estimation Paper • 2510.09320 • Published Oct 10, 2025 • 2 • 2
DreamLLM: Synergistic Multimodal Comprehension and Creation Paper • 2309.11499 • Published Sep 20, 2023 • 60
DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge Paper • 2507.04447 • Published Jul 6, 2025 • 45
DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge Paper • 2507.04447 • Published Jul 6, 2025 • 45
CAST: Component-Aligned 3D Scene Reconstruction from an RGB Image Paper • 2502.12894 • Published Feb 18, 2025 • 19