Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Apps
llama.cpp
LM Studio
Jan
Draw Things
DiffusionBee
Jellybox
JoyFusion
LocalAI
vLLM
Ollama
MLX LM
Docker Model Runner
Lemonade
SGLang
Pi
Inference Providers
Select all
Groq
Novita
Cerebras
SambaNova
Nscale
fal
Hyperbolic
Together AI
Fireworks
Featherless AI
Zai
Replicate
Cohere
Scaleway
Public AI
OVHcloud AI Endpoints
HF Inference API
WaveSpeed
Misc
Reset Misc
multimodal
Inference Endpoints
text-generation-inference
Eval Results (legacy)
text-embeddings-inference
4-bit precision
Merge
custom_code
8-bit precision
Mixture of Experts
Carbon Emissions
Eval Results
Apply filters
Models
3,692
Full-text search
Inference Available
Edit filters
Sort: Trending
Active filters:
multimodal
Clear all
mlx-community/UI-TARS-72B-DPO-8bit
Image-Text-to-Text
•
Updated
Mar 6, 2025
•
8
mlx-community/UI-TARS-72B-DPO-bf16
Image-Text-to-Text
•
Updated
Mar 6, 2025
•
5
•
1
unsloth/Qwen2-VL-72B-bnb-4bit
Image-Text-to-Text
•
76B
•
Updated
Mar 9, 2025
•
4
unsloth/Qwen2-VL-7B
Image-Text-to-Text
•
8B
•
Updated
Mar 9, 2025
•
8
unsloth/Qwen2-VL-7B-bnb-4bit
Image-Text-to-Text
•
9B
•
Updated
Mar 9, 2025
•
81
sbintuitions/sarashina2-vision-8b
Image-to-Text
•
8B
•
Updated
Mar 27, 2025
•
148
•
11
sbintuitions/sarashina2-vision-14b
Image-to-Text
•
14B
•
Updated
Mar 27, 2025
•
87
•
11
timtkddn/ko-ocr-qwen2-vl-awq
Image-Text-to-Text
•
73B
•
Updated
Apr 2, 2025
asuglia/Qwen2-VL-2B-Instruct-Q4_K_M-GGUF
Image-Text-to-Text
•
2B
•
Updated
Mar 10, 2025
•
106
mertaylin/Qwen2-VL-7B
Image-Text-to-Text
•
8B
•
Updated
Mar 12, 2025
•
1
PKU-Alignment/s1-m_7b_beta
Image-Text-to-Text
•
Updated
Mar 13, 2025
•
5
oieieio/Qwen2.5-VL-3B-Instruct
Image-Text-to-Text
•
4B
•
Updated
Mar 14, 2025
•
2
FriendliAI/Qwen2-VL-72B-Instruct
Image-Text-to-Text
•
73B
•
Updated
Mar 17, 2025
•
1
•
1
FriendliAI/Qwen2-VL-7B-Instruct
Image-Text-to-Text
•
8B
•
Updated
Mar 17, 2025
•
4
FriendliAI/Qwen2-VL-2B-Instruct
Image-Text-to-Text
•
2B
•
Updated
Mar 17, 2025
•
26
•
1
FriendliAI/InternVL_2_5_HiCo_R16
Video-Text-to-Text
•
8B
•
Updated
Mar 18, 2025
•
2
•
1
sagaxlearn/Qwen2.5-VL-3B-Instruct
Image-Text-to-Text
•
4B
•
Updated
Mar 18, 2025
samgreen/Qwen2.5-VL-72B-Instruct-GGUF
Image-Text-to-Text
•
73B
•
Updated
Apr 2, 2025
•
307
•
1
Doctor-James/OmniMamba
Any-to-Any
•
Updated
Mar 20, 2025
Efficient-Large-Model/NVILA-Lite-2B-Verifier
Updated
Mar 21, 2025
•
27
•
7
clecho52/Qwen2-VL-7B-Instruct-Q2_K-GGUF
Image-Text-to-Text
•
8B
•
Updated
Mar 19, 2025
•
1
hfl/Qwen2.5-VL-7B-Instruct-GPTQ-Int3
Image-Text-to-Text
•
8B
•
Updated
Mar 20, 2025
•
4
•
1
hfl/Qwen2.5-VL-3B-Instruct-GPTQ-Int3
Image-Text-to-Text
•
4B
•
Updated
Mar 20, 2025
•
2
•
1
cdreetz/audio-llama-hf
Text Generation
•
Updated
Mar 20, 2025
iMeanAI/Qwen2-VL-TokenSelection-2B
Image-Text-to-Text
•
2B
•
Updated
May 5, 2025
•
6
puar-playground/Phi-3-MusiX
Image-Text-to-Text
•
Updated
Aug 14, 2025
•
23
samgreen/Qwen2.5-VL-7B-Instruct-GGUF
Image-Text-to-Text
•
8B
•
Updated
Apr 2, 2025
•
169
•
7
turing-motors/Heron-NVILA-Lite-2B
Image-Text-to-Text
•
Updated
Oct 20, 2025
•
81
•
7
Qwen/Qwen2.5-VL-32B-Instruct
Image-Text-to-Text
•
Updated
Apr 14, 2025
•
223k
•
476
Ertugrul/Qwen2.5-VL-7B-Captioner-Relaxed
Image-Text-to-Text
•
8B
•
Updated
Mar 22, 2025
•
1.19k
•
29
Previous
1
...
23
24
25
26
27
...
100
Next