Inference Providers
Active filters: VLM
mradermacher/Callisto-OCR3-2B-Instruct-i1-GGUF
2B • Updated • 173
• 1
sizzlebop/Orsta-32B-0326-Q8_0-GGUF
Image-Text-to-Text
• 33B • Updated • 1
nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1
Image-Text-to-Text
• Updated • 1.33M
• 177
hongyuw/bitvla-bitsiglipL-224px-bf16
Image-Text-to-Text
• Updated • 34
• 7
hongyuw/bitvla-siglipL-224px-bf16
Image-Text-to-Text
• Updated • 18
• 4
WallyLovesCats/NVILA-Lite-8B
Text Generation
• Updated • 2
nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1-mcore
Image-Text-to-Text
• Updated • 2
AXERA-TECH/InternVL2_5-1B-MPO
Image-Text-to-Text
• Updated • 21
DriveFusion/DriveFusion-V0.2
Robotics
• 4B • Updated • 3
Image-Text-to-Text
• Updated • 50
• 1
DriveFusion/DriveFusionQA
Image-Text-to-Text
• 4B • Updated • 11
Image-Text-to-Text
• 8B • Updated • 24
Image-Text-to-Text
• 2B • Updated • 44
Image-Text-to-Text
• 0.6B • Updated • 77
JayRay5/DIVE-Doc-ARD-LRes
JayRay5/DIVE-Doc-ARD-HRes
Text Generation
• 3B • Updated • 16
Image-Text-to-Text
• 8B • Updated • 246
• 8
Image-Text-to-Text
• 8B • Updated • 57.2k
• 29
Image-Text-to-Text
• 33B • Updated • 176
• 27
Efficient-Large-Model/LongVILA-R1-7B
Updated • 261
• 15
nvidia/VILA-HD-8B-PS3-1.5K-SigLIP2
Image-Text-to-Text
• Updated • 13
nvidia/VILA-HD-8B-PS3-4K-SigLIP2
Image-Text-to-Text
• Updated • 26
• 2
nvidia/VILA-HD-8B-PS3-1.5K-C-RADIOv2
Image-Text-to-Text
• Updated • 15
nvidia/VILA-HD-8B-PS3-4K-C-RADIOv2
Image-Text-to-Text
• Updated • 15
AXERA-TECH/Qwen2.5-VL-7B-Instruct
Image-Text-to-Text
• Updated • 8
mlx-community/VisualQuality-R1-7B-bf16
Reinforcement Learning
• Updated • 7
mlx-community/VisualQuality-R1-7B-6bit
Reinforcement Learning
• Updated • 7
mlx-community/VisualQuality-R1-7B-8bit
Reinforcement Learning
• Updated • 7
mlx-community/VisualQuality-R1-7B-4bit
Reinforcement Learning
• Updated • 10
• 1