Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
microsoft
/
VITRA-VLA-3B
like
13
Follow
Microsoft
17.7k
Robotics
Transformers
English
Robotics
Vision-Language-Action
Manipulation
Multimodal
Pretraining
Diffusion
arxiv:
2510.21571
License:
mit
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
VITRA-VLA-3B
/
config.json
Commit History
Initial commit
643312c
arnoldland
commited on
Dec 9, 2025