Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
tencent
/
Youtu-VL-4B-Instruct
like
154
Follow
Tencent
9.81k
Image-Text-to-Text
Transformers
Safetensors
youtu_vl
text-generation
conversational
custom_code
arxiv:
2601.19798
arxiv:
2512.24618
License:
youtu-vl
Model card
Files
Files and versions
xet
Community
11
Deploy
Use this model
refs/pr/5
Youtu-VL-4B-Instruct
/
assets
5.23 MB
Ctrl+K
Ctrl+K
6 contributors
History:
1 commit
Yinsongliu
Upload model with LFS assets
2951b22
3 months ago
architecture.png
Safe
1.07 MB
xet
Upload model with LFS assets
3 months ago
general-multimodal-performance.png
Safe
408 kB
xet
Upload model with LFS assets
3 months ago
logo.png
614 kB
xet
Upload model with LFS assets
3 months ago
vision-centric-performance.png
Safe
534 kB
xet
Upload model with LFS assets
3 months ago
youtu-vl-logo.png
Safe
95.4 kB
xet
Upload model with LFS assets
3 months ago
youtu-vl-overview.png
Safe
2.51 MB
xet
Upload model with LFS assets
3 months ago