Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
xiaomi-research
/
OneVL_visual_decoder_pt
like
0
Follow
Xiaomi Research
32
Image-Text-to-Text
Safetensors
English
qwen3_vl
autonomous-driving
vision-language-action
chain-of-thought
trajectory-prediction
VLA
conversational
arxiv:
2604.18486
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
refs/pr/1
OneVL_visual_decoder_pt
/
processor_config.json
Commit History
Upload folder using huggingface_hub
9bd9fd6
verified
JinghuiLuAstronaut
commited on
6 days ago