-
-
-
-
-
-
Inference Providers
Active filters:
VLM
Image-Text-to-Text
•
9B
•
Updated
•
76
•
62
Image-Text-to-Text
•
2B
•
Updated
•
367
•
32
Image-Text-to-Text
•
1B
•
Updated
•
253
•
26
mradermacher/Qwen2-VL-OCR-2B-Instruct-GGUF
2B
•
Updated
•
103
•
2
mradermacher/Qwen2-VL-OCR-2B-Instruct-i1-GGUF
2B
•
Updated
•
190
•
4
KnutJaegersberg/Eagle2-1B
Image-Text-to-Text
•
1B
•
Updated
•
3
•
1
KnutJaegersberg/Eagle2-2B
Image-Text-to-Text
•
2B
•
Updated
•
1
titanhacker/moondream-2b-Med-Vqa-Finetuned
2B
•
Updated
•
2
•
1
AXERA-TECH/InternVL2_5-1B
Image-Text-to-Text
•
Updated
•
6
•
1
Efficient-Large-Model/VILA15-3b-hf-preview
Text Generation
•
Updated
•
1
Efficient-Large-Model/Llama-3-VILA15-8B-hf-preview
Text Generation
•
Updated
•
1
Efficient-Large-Model/VILA15-13b-hf-preview
Text Generation
•
Updated
•
1
Efficient-Large-Model/VILA15-40b-hf-preview
Text Generation
•
Updated
•
2
TIGER-Lab/ABC-Qwen2VL-Instruct
Image-Text-to-Text
•
Updated
•
31
AXERA-TECH/SmolVLM-256M-Instruct
Updated
•
13
•
2
JettZhou/PhysVLM-Qwen2.5-3B
4B
•
Updated
•
15
•
2
di-zhang-fdu/eagle2-9B-forked
Image-Text-to-Text
•
9B
•
Updated
MLAdaptiveIntelligence/LLaVAction-7B
Video-Text-to-Text
•
8B
•
Updated
•
8
•
1
AXERA-TECH/Qwen2.5-VL-3B-Instruct
Image-Text-to-Text
•
Updated
•
7
•
1
prithivMLmods/Callisto-OCR3-2B-Instruct
Image-Text-to-Text
•
2B
•
Updated
•
30
•
6
Image-Text-to-Text
•
8B
•
Updated
•
3.58k
•
33
4B
•
Updated
•
42
•
3
mradermacher/TongUI-3B-GGUF
3B
•
Updated
•
67
TianheWu/VisualQuality-R1-7B-preview
Reinforcement Learning
•
8B
•
Updated
•
3
•
7
mm-eval/Llama-3-LongVILA-8B-512Frames
Text Generation
•
Updated
mradermacher/ImageQuality-R1-v1-GGUF
8B
•
Updated
•
94
mradermacher/ImageQuality-R1-v1-i1-GGUF
8B
•
Updated
•
158
•
1
Image-Text-to-Text
•
0.2B
•
Updated
•
138
•
98
nvidia/VILA-HD-8B-PS3-1.5K-SigLIP
Image-Text-to-Text
•
Updated
•
18
•
3