Inference Providers
Active filters: VLM
Efficient-Large-Model/NVILA-8B
Text Generation
• Updated • 984
• 7
Efficient-Large-Model/NVILA-Lite-8B-stage2
Text Generation
• Updated • 11
• 2
Efficient-Large-Model/NVILA-Lite-15B-stage2
Text Generation
• Updated • 10
• 1
prithivMLmods/Qwen2-VL-OCR-2B-Instruct
Image-Text-to-Text
• 2B • Updated • 1.17k
• 103
prithivMLmods/Qwen2-VL-Ocrtest-2B-Instruct
Image-Text-to-Text
• 2B • Updated • 9
• 5
Image-Text-to-Text
• 9B • Updated • 219
• 63
mradermacher/Qwen2-VL-OCR-2B-Instruct-GGUF
2B • Updated • 390
• 2
mradermacher/Qwen2-VL-OCR-2B-Instruct-i1-GGUF
2B • Updated • 554
• 4
KnutJaegersberg/Eagle2-1B
Image-Text-to-Text
• 1B • Updated • 9
• 1
KnutJaegersberg/Eagle2-2B
Image-Text-to-Text
• 2B • Updated • 13
titanhacker/moondream-2b-Med-Vqa-Finetuned
2B • Updated • 3
• 1
AXERA-TECH/InternVL2_5-1B
Image-Text-to-Text
• Updated • 14
• 1
Efficient-Large-Model/VILA15-3b-hf-preview
Text Generation
• Updated • 7
Efficient-Large-Model/Llama-3-VILA15-8B-hf-preview
Text Generation
• Updated • 13
Efficient-Large-Model/VILA15-13b-hf-preview
Text Generation
• Updated • 4
Efficient-Large-Model/VILA15-40b-hf-preview
Text Generation
• Updated • 12
TIGER-Lab/ABC-Qwen2VL-Instruct
Image-Text-to-Text
• Updated • 6
AXERA-TECH/SmolVLM-256M-Instruct
Updated • 7
• 2
JettZhou/PhysVLM-Qwen2.5-3B
4B • Updated • 6
• 2
di-zhang-fdu/eagle2-9B-forked
Image-Text-to-Text
• 9B • Updated MLAdaptiveIntelligence/LLaVAction-7B
Video-Text-to-Text
• 8B • Updated • 7
• 1
AXERA-TECH/Qwen2.5-VL-3B-Instruct
Image-Text-to-Text
• Updated • 19
• 1
prithivMLmods/Callisto-OCR3-2B-Instruct
Image-Text-to-Text
• 2B • Updated • 1.13k
• 7
Image-Text-to-Text
• 8B • Updated • 119k
• 40
4B • Updated • 13
• 4
mradermacher/TongUI-3B-GGUF
3B • Updated • 80
TianheWu/VisualQuality-R1-7B-preview
Reinforcement Learning
• 8B • Updated • 106
• 7
mm-eval/Llama-3-LongVILA-8B-512Frames
Text Generation
• Updated mradermacher/ImageQuality-R1-v1-GGUF
8B • Updated • 337