Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
ShengWen1998
's Collections
TTS Open Source
Multimodal Models
Multimodal Models
updated
Mar 3
Upvote
-
google/gemma-3-12b-it
Image-Text-to-Text
•
Updated
Mar 21, 2025
•
2.55M
•
•
704
HuggingFaceM4/Idefics3-8B-Llama3
Image-Text-to-Text
•
Updated
Dec 2, 2024
•
158k
•
302
HuggingFaceTB/SmolVLM2-2.2B-Instruct
Image-Text-to-Text
•
Updated
Apr 8, 2025
•
98.8k
•
313
microsoft/Phi-3.5-vision-instruct
Image-Text-to-Text
•
Updated
Dec 10, 2025
•
1.49M
•
730
moondream/moondream-2b-2025-04-14
Image-Text-to-Text
•
2B
•
Updated
May 21, 2025
•
294
•
9
Upvote
-
Share collection
View history
Collection guide
Browse collections