-
-
-
-
-
-
Inference Providers
Active filters: vision
Image-Text-to-Text
• Updated
• 5
usernameoccupied/MaterialSpecVision
Object Detection
• Updated
Image Classification
• 85.8M • Updated
• 6
Image-Text-to-Text
• Updated
RedHatAI/Qwen2-VL-72B-Instruct-quantized.w4a16
Image-Text-to-Text
• 13B • Updated
Sci-fi-vy/Llama-3.2-11B-Vision-Instruct-finetuned
Image-Text-to-Text
• 11B • Updated
• 1
jfang/mars-vit-base-ctx2m
86M • Updated
cnmoro/mini-image-captioning
Image-to-Text
• 34.2M • Updated
• 149
• 4
cnmoro/tiny-image-captioning
Image-to-Text
• 26.4M • Updated
• 236
• 3
itsanurag/Llama-3.2-90B-Vision-Instruct-bnb-4bit
Image-Text-to-Text
• 91B • Updated
mrcuddle/llama3.2-11B-Vision_instruct-Coder
Image-Text-to-Text
• 11B • Updated
• 1
cnmoro/nano-image-captioning
Image-to-Text
• 10.1M • Updated
• 169
• 3
lens-ai/clip-vit-base-patch32_pcam_finetuned
Feature Extraction
• 87.5M • Updated
• 1
bartowski/MiniCPM-o-2_6-GGUF
Image-Text-to-Text
• 8B • Updated
• 559
• 6
lmstudio-community/MiniCPM-o-2_6-GGUF
Image-Text-to-Text
• 8B • Updated
• 750
• 8
csdl/clipseg-rd64-refined-with-handler
Image Segmentation
• 0.2B • Updated
• 8
mlx-community/Idefics3-8B-Llama3-4bit
Image-Text-to-Text
• 2B • Updated
• 11
mlx-community/Idefics3-8B-Llama3-3bit
Image-Text-to-Text
• 1B • Updated
• 6
mlx-community/Idefics3-8B-Llama3-6bit
Image-Text-to-Text
• 2B • Updated
• 11
mlx-community/Idefics3-8B-Llama3-8bit
Image-Text-to-Text
• 3B • Updated
• 6
lens-ai/adversarial-clip-vit-base-patch32_pcam_finetuned
Feature Extraction
• Updated
• 1
0.4B • Updated
• 6
Object Detection
• 20.2M • Updated
• 90.2k
• 5
Object Detection
• 31.5M • Updated
• 412
• 7
Object Detection
• 76.8M • Updated
• 4.59k
• 13
RedHatAI/Phi-3-vision-128k-instruct-W4A16-G128
Text Generation
• 1B • Updated
• 2
• 1
kairavishal37/LLava-med-api
Image-Text-to-Text
• 8B • Updated
Image Segmentation
• Updated
Aitrepreneur/Florence-2-base
Image-Text-to-Text
• Updated
Aitrepreneur/Florence-2-large
Image-Text-to-Text
• Updated