VGGT-Det: Mining VGGT Internal Priors for Sensor-Geometry-Free Multi-View Indoor 3D Object Detection Paper • 2603.00912 • Published Mar 1 • 40
FastVLM Collection Efficient Vision Encoding for Vision Language Models • 8 items • Updated 30 days ago • 109