Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
xiaomi-research
/
OneVL_visual_decoder_pt_ar1
like
0
Follow
Xiaomi Research
32
Image-Text-to-Text
Safetensors
English
qwen3_vl
autonomous-driving
vision-language-action
chain-of-thought
trajectory-prediction
VLA
conversational
arxiv:
2604.18486
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (0)
Sort: Recently created
Update model card metadata and usage information
#1 opened about 10 hours ago by
nielsr