Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
albertmundu
's Collections
Vision-Language Collections
Vision-Language Collections
updated
Sep 26, 2023
Some of the popular models for image-text domain
Upvote
-
Salesforce/blip-image-captioning-large
Image-to-Text
•
0.5B
•
Updated
Feb 3, 2025
•
1.4M
•
1.47k
Salesforce/blip-image-captioning-base
Image-to-Text
•
Updated
Feb 3, 2025
•
2.15M
•
849
Salesforce/instructblip-vicuna-7b
Image-Text-to-Text
•
8B
•
Updated
Feb 3, 2025
•
11.3k
•
99
microsoft/git-large-coco
Image-to-Text
•
0.4B
•
Updated
Jun 26, 2023
•
5.03k
•
105
Salesforce/blip2-opt-2.7b
Image-Text-to-Text
•
4B
•
Updated
Feb 3, 2025
•
428k
•
438
Salesforce/blip2-flan-t5-xxl
Image-Text-to-Text
•
12B
•
Updated
Feb 3, 2025
•
1.28k
•
94
Salesforce/instructblip-flan-t5-xxl
Image-Text-to-Text
•
12B
•
Updated
Feb 3, 2025
•
635
•
21
Salesforce/instructblip-flan-t5-xl
Image-Text-to-Text
•
Updated
Feb 3, 2025
•
10.5k
•
30
microsoft/trocr-base-handwritten
Image-to-Text
•
0.3B
•
Updated
Feb 11, 2025
•
151k
•
490
microsoft/trocr-base-printed
Image-to-Text
•
0.3B
•
Updated
May 27, 2024
•
780k
•
205
microsoft/trocr-large-printed
Image-to-Text
•
0.6B
•
Updated
May 27, 2024
•
311k
•
179
microsoft/trocr-small-printed
Image-to-Text
•
61.4M
•
Updated
May 27, 2024
•
26.9k
•
48
microsoft/trocr-large-handwritten
Image-to-Text
•
Updated
May 27, 2024
•
254k
•
158
google/pix2struct-large
Image-to-Text
•
1B
•
Updated
Sep 6, 2023
•
1.18k
•
34
Upvote
-
Share collection
View history
Collection guide
Browse collections