AudioPaLM: A Large Language Model That Can Speak and Listen Paper • 2306.12925 • Published Jun 22, 2023 • 56
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling Paper • 2311.00430 • Published Nov 1, 2023 • 56
nvidia/diar_sortformer_4spk-v1 Automatic Speech Recognition • 0.1B • Updated Dec 15, 2025 • 7.38k • 144
nvidia/stt_ar_fastconformer_hybrid_large_pcd_v1.0 Automatic Speech Recognition • Updated Oct 21, 2025 • 5.06k • 40
Running on CPU Upgrade Agents Featured 1.39k Open ASR Leaderboard 🏆 1.39k Explore speech model performance benchmarks