Inference Providers
Active filters: VLM
KnutJaegersberg/Eagle2-2B
Image-Text-to-Text
• 2B • Updated • 3
titanhacker/moondream-2b-Med-Vqa-Finetuned
2B • Updated • 23
• 1
AXERA-TECH/InternVL2_5-1B
Image-Text-to-Text
• Updated • 9
• 1
Efficient-Large-Model/VILA15-3b-hf-preview
Text Generation
• Updated • 221
Efficient-Large-Model/Llama-3-VILA15-8B-hf-preview
Text Generation
• Updated • 4
Efficient-Large-Model/VILA15-13b-hf-preview
Text Generation
• Updated • 4
Efficient-Large-Model/VILA15-40b-hf-preview
Text Generation
• Updated • 4
TIGER-Lab/ABC-Qwen2VL-Instruct
Image-Text-to-Text
• Updated • 5
AXERA-TECH/SmolVLM-256M-Instruct
Updated • 11
• 2
JettZhou/PhysVLM-Qwen2.5-3B
4B • Updated • 2
• 2
di-zhang-fdu/eagle2-9B-forked
Image-Text-to-Text
• 9B • Updated MLAdaptiveIntelligence/LLaVAction-7B
Video-Text-to-Text
• 8B • Updated • 15
• 1
AXERA-TECH/Qwen2.5-VL-3B-Instruct
Image-Text-to-Text
• Updated • 15
• 1
prithivMLmods/Callisto-OCR3-2B-Instruct
Image-Text-to-Text
• 2B • Updated • 281
• 7
4B • Updated • 16
• 4
mradermacher/TongUI-3B-GGUF
3B • Updated • 104
TianheWu/VisualQuality-R1-7B-preview
Reinforcement Learning
• 8B • Updated • 9
• 7
mm-eval/Llama-3-LongVILA-8B-512Frames
Text Generation
• Updated • 4
mradermacher/ImageQuality-R1-v1-GGUF
8B • Updated • 111
mradermacher/ImageQuality-R1-v1-i1-GGUF
8B • Updated • 2.34k
• 1
Image-Text-to-Text
• 0.2B • Updated • 434
• 99
nvidia/VILA-HD-8B-PS3-1.5K-SigLIP
Image-Text-to-Text
• Updated • 21
• 3
nvidia/VILA-HD-8B-PS3-4K-SigLIP
Image-Text-to-Text
• Updated • 35
• 1
One-RL-to-See-Them-All/Orsta-7B
Image-Text-to-Text
• 8B • Updated • 13
• 11
One-RL-to-See-Them-All/Orsta-32B-0321
Image-Text-to-Text
• 33B • Updated • 14
• 1
One-RL-to-See-Them-All/Orsta-32B-0326
Image-Text-to-Text
• 33B • Updated • 11
• 8
TianheWu/VisualQuality-R1-7B
Reinforcement Learning
• 8B • Updated • 1.76k
• 11
mradermacher/Orsta-7B-GGUF
Reinforcement Learning
• 8B • Updated • 92
mradermacher/Orsta-32B-0326-GGUF
33B • Updated • 260
mradermacher/Orsta-32B-0321-GGUF
33B • Updated • 60