Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
yjj23
/
minivlm
like
0
PyTorch
English
vlm_model
vision-language-model
multimodal
vision
qwen
siglip
Model card
Files
Files and versions
xet
Community
2
Copy to bucket
new
VLM Model: Qwen2.5 + SigLIP
VLM Model: Qwen2.5 + SigLIP
This model combines:
Vision encoder: google/siglip-base-patch16-224
Language model: Qwen/Qwen2.5-0.5B-Instruct
Downloads last month
4
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support