microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 484k • 1.6k
openai/whisper-large-v3-turbo Automatic Speech Recognition • 0.8B • Updated Oct 4, 2024 • 7.95M • • 3.07k
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit Image-Text-to-Text • 11B • Updated Dec 10, 2024 • 6.17k • 81
jonatasgrosman/wav2vec2-large-xlsr-53-english Automatic Speech Recognition • 0.3B • Updated Mar 25, 2023 • 61.9k • 476