lamm-mit/Cephalo-Llama-3.2-11B-Vision-Instruct-128k Image-Text-to-Text • 11B • Updated Sep 30, 2024 • 4 • 6
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated Dec 10, 2025 • 291k • 1.58k