Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
carlizor
's Collections
Agents
Multi lora spaces
TTS
Utilities
Document retrieval / chat
Flux
Image restoration
3D Generation
LLM
Embedding
LLM - Small
Video vision
To Read
Video
Image Segmentation
Image Generation (Fast)
Image Depth
Image caption
Audio
Image Generation
Image that talks
Image Enhance
Image Vision
Image editing
Image upscaling
Face Recognition
Multimodal
LLM - Medium
Image Vision
updated
Aug 12, 2025
Upvote
-
Salesforce/xgen-mm-phi3-mini-instruct-r-v1
Image-Text-to-Text
•
5B
•
Updated
Feb 3, 2025
•
202
•
185
AIDC-AI/Ovis1.6-Gemma2-9B
Image-Text-to-Text
•
Updated
Aug 15, 2025
•
75.7k
•
273
nvidia/NVLM-D-72B
Image-Text-to-Text
•
Updated
Jan 14, 2025
•
126k
•
775
microsoft/OmniParser
Image-Text-to-Text
•
Updated
Dec 2, 2024
•
230
•
1.71k
deepseek-ai/Janus-1.3B
Any-to-Any
•
2B
•
Updated
Jan 27, 2025
•
4.25k
•
596
deepseek-ai/JanusFlow-1.3B
Any-to-Any
•
2B
•
Updated
Jan 27, 2025
•
403
•
151
NexaAI/OmniVLM-968M
0.5B
•
Updated
Aug 20, 2025
•
727
•
532
vikhyatk/moondream2
Image-Text-to-Text
•
2B
•
Updated
Sep 23, 2025
•
2.83M
•
1.41k
stepfun-ai/GOT-OCR2_0
Image-Text-to-Text
•
0.7B
•
Updated
Feb 4, 2025
•
115k
•
1.54k
jiuhai/florence-vl-8b-sft
Updated
Dec 3, 2024
•
6
•
21
AI-Safeguard/Ivy-VL-llava
Visual Question Answering
•
4B
•
Updated
Apr 28, 2025
•
48
•
72
OpenGVLab/InternVL2_5-78B
Image-Text-to-Text
•
Updated
Sep 11, 2025
•
1.15k
•
194
Qwen/QVQ-72B-Preview
Image-Text-to-Text
•
73B
•
Updated
Jan 12, 2025
•
299
•
610
deepseek-ai/deepseek-vl2
Image-Text-to-Text
•
Updated
Dec 18, 2024
•
5.72k
•
381
allenai/Molmo-7B-D-0924
Image-Text-to-Text
•
8B
•
Updated
Dec 15, 2025
•
49.4k
•
566
prithivMLmods/Qwen2-VL-OCR-2B-Instruct
Image-Text-to-Text
•
2B
•
Updated
May 2, 2025
•
966
•
103
ByteDance/Sa2VA-1B
Image-Text-to-Text
•
1B
•
Updated
Sep 8, 2025
•
981
•
30
HuggingFaceTB/SmolVLM-500M-Instruct
Image-Text-to-Text
•
0.5B
•
Updated
Apr 8, 2025
•
187k
•
193
Qwen/Qwen2.5-VL-72B-Instruct
Image-Text-to-Text
•
73B
•
Updated
Jun 6, 2025
•
185k
•
•
616
Qwen/Qwen2.5-VL-7B-Instruct
Image-Text-to-Text
•
8B
•
Updated
Apr 6, 2025
•
8.54M
•
•
1.53k
OpenGVLab/InternVideo2_5_Chat_8B
Video-Text-to-Text
•
8B
•
Updated
Aug 4, 2025
•
6.1k
•
90
nvidia/Eagle2-9B
Image-Text-to-Text
•
9B
•
Updated
Jan 28, 2025
•
471
•
63
stepfun-ai/GOT-OCR-2.0-hf
Image-Text-to-Text
•
0.6B
•
Updated
Jan 31, 2025
•
51.3k
•
229
allenai/olmOCR-7B-0225-preview
Image-Text-to-Text
•
8B
•
Updated
Aug 19, 2025
•
11k
•
708
microsoft/Magma-8B
Robotics
•
9B
•
Updated
Dec 10, 2025
•
545
•
415
marco/mcdse-2b-v1
2B
•
Updated
Oct 29, 2024
•
6.67k
•
56
CohereLabs/aya-vision-8b
Image-Text-to-Text
•
9B
•
Updated
Jan 9
•
161k
•
321
Skywork/Skywork-R1V-38B
Image-Text-to-Text
•
38B
•
Updated
Aug 12, 2025
•
112k
•
128
docling-project/SmolDocling-256M-preview
Image-Text-to-Text
•
Updated
Sep 17, 2025
•
33.3k
•
1.61k
Qwen/Qwen2.5-VL-32B-Instruct
Image-Text-to-Text
•
33B
•
Updated
Apr 14, 2025
•
154k
•
482
reducto/RolmOCR
Image-Text-to-Text
•
8B
•
Updated
Apr 2, 2025
•
316k
•
586
moonshotai/Kimi-VL-A3B-Thinking
Image-Text-to-Text
•
16B
•
Updated
Jan 30
•
128k
•
446
XiaomiMiMo/MiMo-VL-7B-RL
Image-Text-to-Text
•
8B
•
Updated
Jun 7, 2025
•
4.29k
•
169
nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1
Image-Text-to-Text
•
Updated
Dec 4, 2025
•
1.14M
•
178
ByteDance/Dolphin
Image-Text-to-Text
•
Updated
Jul 16, 2025
•
332
•
515
nanonets/Nanonets-OCR-s
Image-Text-to-Text
•
4B
•
Updated
Jun 20, 2025
•
188k
•
1.59k
echo840/MonkeyOCR
Image-Text-to-Text
•
Updated
Mar 3
•
172
•
515
moonshotai/Kimi-VL-A3B-Thinking-2506
Image-Text-to-Text
•
16B
•
Updated
Jan 30
•
10.8k
•
358
prithivMLmods/DREX-062225-exp
Image-Text-to-Text
•
8B
•
Updated
Jul 20, 2025
•
21
•
6
zai-org/GLM-4.1V-9B-Thinking
Image-Text-to-Text
•
10B
•
Updated
Oct 25, 2025
•
426k
•
776
HelloKKMe/GTA1-72B
Image-Text-to-Text
•
73B
•
Updated
Jul 8, 2025
•
25
•
4
rednote-hilab/dots.ocr
Image-Text-to-Text
•
3B
•
Updated
Oct 31, 2025
•
207k
•
1.3k
Upvote
-
Share collection
View history
Collection guide
Browse collections