Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
tencent
/
HY-Embodied-0.5
like
36
Follow
Tencent
9.29k
Image-Text-to-Text
Transformers
Safetensors
multilingual
hunyuan_vl_mot
hunyuan
vision-language
Embodied
image-to-text
2B
end-to-end
MoT
conversational
custom_code
arxiv:
2604.07430
License:
other
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
HY-Embodied-0.5
/
video_preprocessor_config.json
Commit History
init HY-Embodied-0.5
1400ae4
castleyu
commited on
about 23 hours ago