Inference Providers
Active filters: VLM
mradermacher/Orsta-7B-i1-GGUF
Reinforcement Learning
• 8B • Updated • 300
mradermacher/Callisto-OCR3-2B-Instruct-GGUF
2B • Updated • 143
mradermacher/VisualQuality-R1-7B-GGUF
8B • Updated • 955
mradermacher/Callisto-OCR3-2B-Instruct-i1-GGUF
2B • Updated • 167
• 1
sizzlebop/Orsta-32B-0326-Q8_0-GGUF
Image-Text-to-Text
• 33B • Updated • 1
nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1
Image-Text-to-Text
• Updated • 1.65M
• 177
hongyuw/bitvla-bitsiglipL-224px-bf16
Image-Text-to-Text
• Updated • 33
• 7
hongyuw/bitvla-siglipL-224px-bf16
Image-Text-to-Text
• Updated • 18
• 4
WallyLovesCats/NVILA-Lite-8B
Text Generation
• Updated • 2
nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1-mcore
Image-Text-to-Text
• Updated • 2
AXERA-TECH/InternVL2_5-1B-MPO
Image-Text-to-Text
• Updated • 17
DriveFusion/DriveFusion-V0.2
Robotics
• 4B • Updated • 3
Image-Text-to-Text
• Updated • 45
• 1
DriveFusion/DriveFusionQA
Image-Text-to-Text
• 4B • Updated • 11
Image-Text-to-Text
• 8B • Updated • 24
Image-Text-to-Text
• 2B • Updated • 43
Image-Text-to-Text
• 0.6B • Updated • 65
JayRay5/DIVE-Doc-ARD-LRes
JayRay5/DIVE-Doc-ARD-HRes
Text Generation
• 3B • Updated • 16
Image-Text-to-Text
• 8B • Updated • 58.1k
• 29
Image-Text-to-Text
• 33B • Updated • 177
• 27
Efficient-Large-Model/LongVILA-R1-7B
Updated • 249
• 15
nvidia/VILA-HD-8B-PS3-1.5K-SigLIP2
Image-Text-to-Text
• Updated • 14
nvidia/VILA-HD-8B-PS3-4K-SigLIP2
Image-Text-to-Text
• Updated • 26
• 2
nvidia/VILA-HD-8B-PS3-1.5K-C-RADIOv2
Image-Text-to-Text
• Updated • 16
nvidia/VILA-HD-8B-PS3-4K-C-RADIOv2
Image-Text-to-Text
• Updated • 17
AXERA-TECH/Qwen2.5-VL-7B-Instruct
Image-Text-to-Text
• Updated • 8
mlx-community/VisualQuality-R1-7B-bf16
Reinforcement Learning
• Updated • 5
mlx-community/VisualQuality-R1-7B-6bit
Reinforcement Learning
• Updated • 4