Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
microsoft
/
VITRA-VLA-3B
like
14
Follow
Microsoft
17.8k
Robotics
Transformers
English
Robotics
Vision-Language-Action
Manipulation
Multimodal
Pretraining
Diffusion
arxiv:
2510.21571
License:
mit
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
VITRA-VLA-3B
/
.gitattributes
Commit History
initial commit
1d14074
verified
arnoldland
commited on
Dec 9, 2025