microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated Dec 10, 2025 • 334k • 1.58k
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit Image-Text-to-Text • 11B • Updated Dec 10, 2024 • 11.8k • 79
jonatasgrosman/wav2vec2-large-xlsr-53-english Automatic Speech Recognition • 0.3B • Updated Mar 25, 2023 • 198k • 476