AudioPaLM: A Large Language Model That Can Speak and Listen Paper • 2306.12925 • Published Jun 22, 2023 • 56
distil-whisper/distil-large-v2 Automatic Speech Recognition • 0.8B • Updated Mar 6, 2025 • 9.16k • 514
distil-whisper/distil-small.en Automatic Speech Recognition • 0.2B • Updated Mar 25, 2024 • 38.5k • 113
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling Paper • 2311.00430 • Published Nov 1, 2023 • 56
nvidia/diar_sortformer_4spk-v1 Automatic Speech Recognition • 0.1B • Updated Dec 15, 2025 • 5.52k • 137
nvidia/stt_ar_fastconformer_hybrid_large_pcd_v1.0 Automatic Speech Recognition • Updated Oct 21, 2025 • 48.5k • 36
Running on CPU Upgrade Featured 1.3k Open ASR Leaderboard 🏆 1.3k Explore speech model benchmarks and request new evaluations