Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
microsoft
/
VITRA-VLA-3B
like
13
Follow
Microsoft
17.7k
Robotics
Transformers
English
Robotics
Vision-Language-Action
Manipulation
Multimodal
Pretraining
Diffusion
arxiv:
2510.21571
License:
mit
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
VITRA-VLA-3B
Commit History
update the tag
4bd47d5
arnoldland
commited on
Dec 9, 2025
Initial commit
643312c
arnoldland
commited on
Dec 9, 2025
initial commit
1d14074
verified
arnoldland
commited on
Dec 9, 2025