AudioPaLM: A Large Language Model That Can Speak and Listen Paper • 2306.12925 • Published Jun 22, 2023 • 55
distil-whisper/distil-small.en Automatic Speech Recognition • 0.2B • Updated Mar 25, 2024 • 10.2k • 112
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling Paper • 2311.00430 • Published Nov 1, 2023 • 56
nvidia/diar_sortformer_4spk-v1 Automatic Speech Recognition • 0.1B • Updated Dec 15, 2025 • 3.71k • 131
nvidia/stt_ar_fastconformer_hybrid_large_pcd_v1.0 Automatic Speech Recognition • Updated Oct 21, 2025 • 784 • 30
Running on CPU Upgrade Featured 1.22k Open ASR Leaderboard 🏆 1.22k Explore speech model benchmarks and submit evaluation requests