Vision
updated
liuhaotian/llava-v1.6-34b
Image-Text-to-Text
• 35B • Updated • 30.3k
• 362
deepseek-ai/deepseek-vl-7b-base
7B • Updated • 76
• 64
deepseek-ai/deepseek-vl-7b-chat
Image-Text-to-Text
• 7B • Updated • 7.81k
• 270
HuggingFaceM4/idefics2-8b
Image-Text-to-Text
• 8B • Updated • 162k
• 620
HuggingFaceM4/idefics2-8b-chatty
Image-Text-to-Text
• 8B • Updated • 71
• 95
HuggingFaceM4/idefics2-8b-base
Image-Text-to-Text
• 8B • Updated • 1.66k
• 28
google/paligemma-3b-pt-896
Image-Text-to-Text
• 3B • Updated • 713
• 123
microsoft/Phi-3-vision-128k-instruct
Text Generation
• Updated • 95.9k
• 970
Image-Text-to-Text
• 7B • Updated • 87.7k
• 198
microsoft/Phi-3.5-vision-instruct
Image-Text-to-Text
• Updated • 1.27M
• 728
meta-llama/Llama-3.2-11B-Vision
Image-Text-to-Text
• 11B • Updated • 13.3k
• 585
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text
• 11B • Updated • 293k
• 1.58k
meta-llama/Llama-3.2-90B-Vision
Image-Text-to-Text
• 89B • Updated • 2.67k
• 133
meta-llama/Llama-3.2-90B-Vision-Instruct
Image-Text-to-Text
• 89B • Updated • 2k
• 354
meta-llama/Llama-Guard-3-11B-Vision
Image-Text-to-Text
• 11B • Updated • 2.46k
• 70
Image-Text-to-Text
• 73B • Updated • 4.22k
• 298
Image-Text-to-Text
• 8B • Updated • 27.2k
• 565
Image-Text-to-Text
• 8B • Updated • 2.58k
• 163
Image-Text-to-Text
• Updated • 1.25k
• 157
Text-to-Video
• Updated • 9.49k
• • 1.32k
Image-Text-to-Text
• Updated • 349
• 1.71k
Image-to-Video
• Updated • 395k
• • 2.13k