Vision
updated
liuhaotian/llava-v1.6-34b
Image-Text-to-Text
• 35B • Updated • 30.3k
• 362
deepseek-ai/deepseek-vl-7b-base
7B • Updated • 76
• 64
deepseek-ai/deepseek-vl-7b-chat
Image-Text-to-Text
• 7B • Updated • 7.7k
• 270
HuggingFaceM4/idefics2-8b
Image-Text-to-Text
• 8B • Updated • 155k
• 620
HuggingFaceM4/idefics2-8b-chatty
Image-Text-to-Text
• 8B • Updated • 73
• 95
HuggingFaceM4/idefics2-8b-base
Image-Text-to-Text
• 8B • Updated • 1.63k
• 28
google/paligemma-3b-pt-896
Image-Text-to-Text
• 3B • Updated • 699
• 123
microsoft/Phi-3-vision-128k-instruct
Text Generation
• Updated • 93k
• 970
Image-Text-to-Text
• 7B • Updated • 85.3k
• 198
microsoft/Phi-3.5-vision-instruct
Image-Text-to-Text
• Updated • 1.23M
• 728
meta-llama/Llama-3.2-11B-Vision
Image-Text-to-Text
• 11B • Updated • 13.2k
• 585
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text
• 11B • Updated • 290k
• 1.58k
meta-llama/Llama-3.2-90B-Vision
Image-Text-to-Text
• 89B • Updated • 2.67k
• 133
meta-llama/Llama-3.2-90B-Vision-Instruct
Image-Text-to-Text
• 89B • Updated • 2.02k
• 354
meta-llama/Llama-Guard-3-11B-Vision
Image-Text-to-Text
• 11B • Updated • 2.41k
• 69
Image-Text-to-Text
• 73B • Updated • 4.23k
• 298
Image-Text-to-Text
• 8B • Updated • 27.3k
• 565
Image-Text-to-Text
• 8B • Updated • 2.55k
• 163
Image-Text-to-Text
• Updated • 1.21k
• 157
Text-to-Video
• Updated • 10k
• • 1.32k
Image-Text-to-Text
• Updated • 348
• 1.71k
Image-to-Video
• Updated • 401k
• • 2.13k