Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
blanchefort
's Collections
Medical
VLA models
Audio
Translate
OCR
OmniModels
Edge models
Video encoders
Judge
Datasets for Embodied
Ru text encoders
Text2Image
VLMs
VLMs
updated
Mar 2
Upvote
-
Qwen/Qwen2-VL-7B-Instruct
Image-Text-to-Text
•
8B
•
Updated
Feb 6, 2025
•
3.45M
•
1.27k
NVEagle/Eagle-X5-13B-Chat
Image-Text-to-Text
•
15B
•
Updated
Sep 16, 2024
•
13
•
28
internlm/internlm-xcomposer2d5-7b
Visual Question Answering
•
Updated
Jul 22, 2024
•
333
•
210
AIRI-Institute/OmniFusion
Updated
Apr 10, 2024
•
59
OpenGVLab/InternVideo2_chat_8B_HD
Video-Text-to-Text
•
8B
•
Updated
Dec 18, 2024
•
154
•
18
OpenGVLab/InternVideo2-Chat-8B
Video-Text-to-Text
•
8B
•
Updated
Oct 10, 2024
•
158
•
26
zai-org/cogvlm2-video-llama3-chat
Text Generation
•
13B
•
Updated
Jul 24, 2024
•
2.84k
•
55
nyu-visionx/cambrian-34b
Text Generation
•
35B
•
Updated
Jun 28, 2024
•
37
•
27
zai-org/cogvlm-base-490-hf
Text Generation
•
18B
•
Updated
Nov 20, 2023
•
87
•
7
zai-org/cogvlm-chat-hf
Text Generation
•
18B
•
Updated
Dec 19, 2023
•
979
•
199
zai-org/cogvlm-grounding-generalist-hf
Text Generation
•
18B
•
Updated
Dec 11, 2023
•
165
•
16
Qwen/Qwen-VL
Text Generation
•
Updated
Jan 25, 2024
•
102k
•
278
liuhaotian/llava-v1.5-7b
Image-Text-to-Text
•
Updated
May 8, 2024
•
184k
•
551
LanguageBind/MoE-LLaVA-Phi2-2.7B-4e-384
Text Generation
•
6B
•
Updated
Feb 1, 2024
•
27
•
32
LanguageBind/Video-LLaVA-7B-hf
Image-Text-to-Text
•
7B
•
Updated
May 16, 2024
•
14.4k
•
50
openvla/openvla-7b-prismatic
Image-Text-to-Text
•
Updated
Jul 9, 2024
•
84
•
6
openvla/openvla-7b-finetuned-libero-object
Image-Text-to-Text
•
8B
•
Updated
Oct 9, 2024
•
66.4k
•
1
openvla/openvla-7b-finetuned-libero-10
Image-Text-to-Text
•
8B
•
Updated
Oct 9, 2024
•
6.22k
•
6
IntelLabs/LlavaOLMoBitnet1B
Updated
Aug 30, 2024
•
173
•
30
mistral-community/pixtral-12b-240910
Image-Text-to-Text
•
Updated
Oct 1, 2024
•
6.88k
•
381
LanguageBind/MoE-LLaVA-StableLM-1.6B-4e
Text Generation
•
3B
•
Updated
Feb 1, 2024
•
2.97k
•
8
llava-hf/LLaVA-NeXT-Video-7B-hf
Video-Text-to-Text
•
7B
•
Updated
Nov 11, 2025
•
143k
•
123
Qwen/Qwen-VL-Chat
Text Generation
•
Updated
Jan 25, 2024
•
79.8k
•
383
LanguageBind/Video-LLaVA-7B
Text Generation
•
7B
•
Updated
Apr 9, 2024
•
2.2k
•
89
LanguageBind/LanguageBind_Image
Zero-Shot Image Classification
•
Updated
Feb 1, 2024
•
22k
•
11
LanguageBind/LanguageBind_Video
Zero-Shot Image Classification
•
Updated
Feb 1, 2024
•
3.28k
•
3
llava-hf/llava-1.5-13b-hf
Image-Text-to-Text
•
13B
•
Updated
Jan 27, 2025
•
11.9k
•
34
llava-hf/llava-1.5-7b-hf
Image-Text-to-Text
•
7B
•
Updated
Jun 6, 2025
•
2.84M
•
359
FreedomIntelligence/LongLLaVA-53B-A13B
Image-Text-to-Text
•
52B
•
Updated
Nov 28, 2024
•
22
•
20
meta-llama/Llama-3.2-11B-Vision
Image-Text-to-Text
•
11B
•
Updated
Sep 27, 2024
•
13.3k
•
586
BAAI/Emu3-VisionTokenizer
Feature Extraction
•
0.3B
•
Updated
Oct 8, 2024
•
5.67k
•
63
openbmb/MiniCPM-V-2_6
Image-Text-to-Text
•
8B
•
Updated
Jun 13, 2025
•
110k
•
1.04k
openbmb/MiniCPM-V
Visual Question Answering
•
3B
•
Updated
Jan 15, 2025
•
1.45k
•
202
openbmb/MiniCPM-V-2
Visual Question Answering
•
3B
•
Updated
Jan 15, 2025
•
19.8k
•
495
openbmb/MiniCPM-Llama3-V-2_5
Image-Text-to-Text
•
9B
•
Updated
Jan 15, 2025
•
17.1k
•
1.41k
nvidia/NVLM-D-72B
Image-Text-to-Text
•
Updated
Jan 14, 2025
•
108k
•
775
vikhyatk/moondream2
Image-Text-to-Text
•
2B
•
Updated
Sep 23, 2025
•
2.66M
•
1.41k
allenai/Molmo-72B-0924
Image-Text-to-Text
•
73B
•
Updated
Oct 9, 2025
•
4.32k
•
298
allenai/MolmoE-1B-0924
Image-Text-to-Text
•
Updated
Apr 24, 2025
•
6.53k
•
157
allenai/Molmo-7B-D-0924
Image-Text-to-Text
•
8B
•
Updated
Dec 15, 2025
•
39.3k
•
566
allenai/Molmo-7B-O-0924
Image-Text-to-Text
•
8B
•
Updated
Oct 9, 2025
•
1.09k
•
163
deepseek-ai/Janus-1.3B
Any-to-Any
•
2B
•
Updated
Jan 27, 2025
•
4.15k
•
595
neulab/Pangea-7B
8B
•
Updated
Oct 24, 2024
•
261
•
133
neulab/Pangea-7B-hf
8B
•
Updated
Oct 28, 2025
•
411
•
13
BAAI/Aquila-VL-2B-llava-qwen
Visual Question Answering
•
Updated
Nov 25, 2024
•
49
•
62
mistralai/Pixtral-Large-Instruct-2411
Updated
Jul 28, 2025
•
149
•
433
google/paligemma2-10b-pt-224
Image-Text-to-Text
•
10B
•
Updated
Dec 5, 2024
•
660
•
8
google/paligemma2-3b-pt-224
Image-Text-to-Text
•
Updated
Dec 5, 2024
•
13.5k
•
168
vidore/colqwen2-v1.0
Visual Document Retrieval
•
Updated
Jun 5, 2025
•
72.5k
•
117
deepseek-ai/Janus-Pro-7B
Any-to-Any
•
Updated
Feb 1, 2025
•
32.5k
•
3.6k
deepseek-ai/Janus-Pro-1B
Any-to-Any
•
Updated
Feb 1, 2025
•
13.4k
•
476
nvidia/Eagle2-9B
Image-Text-to-Text
•
9B
•
Updated
Jan 28, 2025
•
677
•
63
openbmb/MiniCPM-o-2_6
Any-to-Any
•
9B
•
Updated
Oct 5, 2025
•
283k
•
1.29k
DAMO-NLP-SG/VideoLLaMA3-7B
Video-Text-to-Text
•
8B
•
Updated
Sep 2, 2025
•
58k
•
75
DAMO-NLP-SG/VideoLLaMA3-2B
Video-Text-to-Text
•
2B
•
Updated
Sep 3, 2025
•
4.85k
•
21
AIDC-AI/Ovis2-8B
Image-Text-to-Text
•
9B
•
Updated
Aug 15, 2025
•
28k
•
75
Qwen/Qwen3-VL-2B-Thinking
Image-Text-to-Text
•
2B
•
Updated
Oct 20, 2025
•
98.1k
•
112
LiquidAI/LFM2-VL-3B
Image-Text-to-Text
•
3B
•
Updated
Mar 30
•
10.8k
•
133
facebook/sam3
Mask Generation
•
0.9B
•
Updated
Nov 20, 2025
•
3.03M
•
1.96k
stepfun-ai/Step3-VL-10B-FP8
Image-Text-to-Text
•
Updated
Feb 4
•
337
•
10
nvidia/llama-nemotron-colembed-vl-3b-v2
Visual Document Retrieval
•
4B
•
Updated
Feb 21
•
1.01k
•
21
Upvote
-
Share collection
View history
Collection guide
Browse collections