DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation
Paper
•
2601.22153
•
Published
•
68
Computer Vision and Deep Learning
DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation
Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation