microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 512k • 1.61k
llava-hf/llava-onevision-qwen2-72b-ov-hf Image-Text-to-Text • 73B • Updated Jun 18, 2025 • 1.81k • 10
facebook/metaclip-h14-fullcc2.5b Zero-Shot Image Classification • 1.0B • Updated Jan 11, 2024 • 9.35k • 48