Sparse Video Generation Propels Real-World Beyond-the-View Vision-Language Navigation Paper • 2602.05827 • Published 9 days ago • 13
EgoHumanoid: Unlocking In-the-Wild Loco-Manipulation with Robot-Free Egocentric Demonstration Paper • 2602.10106 • Published 4 days ago • 16
RISE: Self-Improving Robot Policy with Compositional World Model Paper • 2602.11075 • Published 3 days ago • 20
χ_{0}: Resource-Aware Robust Manipulation via Taming Distributional Inconsistencies Paper • 2602.09021 • Published 5 days ago • 20
GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning Paper • 2602.12099 • Published 2 days ago • 41
Towards Scalable Pre-training of Visual Tokenizers for Generation Paper • 2512.13687 • Published Dec 15, 2025 • 105
EditThinker: Unlocking Iterative Reasoning for Any Image Editor Paper • 2512.05965 • Published Dec 5, 2025 • 38
Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations Paper • 2510.23607 • Published Oct 27, 2025 • 179
view article Article Metric and Relative Monocular Depth Estimation: An Overview. Fine-Tuning Depth Anything V2 👐 📚 Jul 10, 2024 • 93
Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models Paper • 2505.17015 • Published May 22, 2025 • 9
MeshPad: Interactive Sketch Conditioned Artistic-designed Mesh Generation and Editing Paper • 2503.01425 • Published Mar 3, 2025 • 14