VaseVQA: Multimodal Agent and Benchmark for Ancient Greek Pottery
Paper
•
2509.17191
•
Published
•
1
None defined yet.
SwiftVLA: Unlocking Spatiotemporal Dynamics for Lightweight VLA Models at Minimal Overhead
DriveGen3D: Boosting Feed-Forward Driving Scene Generation with Efficient Video Diffusion