CoVLA: Comprehensive Vision-Language-Action Dataset for Autonomous Driving
Turing Inc.
company
Verified
AI & ML interests
Autonomous Driving
Recent Activity
View all activity
CoVLA: Comprehensive Vision-Language-Action Dataset for Autonomous Driving
STRIDE-QA: Visual Question Answering Dataset for Spatiotemporal Reasoning in Urban Driving Scenes
Heron: Japanese Vision Language Models
One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression. (https://arxiv.org/abs/2501.10064)