Running on CPU Upgrade Agents Featured 1.39k Open ASR Leaderboard 🏆 1.39k Compare speech-to-text models by WER and speed
Running on Zero Agents Featured 688 Di♪♪Rhythm 🎶 688 Blazingly Fast and Embarrassingly Simple Song Generation
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 509k • 1.61k