Running on Zero Agents 118 Parakeet TDT 0.6b V3 π₯ 118 Transcribe Speech with Multilingual parakeet-tdt-0.6b-v3
Running on CPU Upgrade Agents Featured 1.36k Open ASR Leaderboard π 1.36k Compare speechβtoβtext models across multiple benchmarks
openai/whisper-large-v3-turbo Automatic Speech Recognition β’ 0.8B β’ Updated Oct 4, 2024 β’ 8.38M β’ β’ 3.05k
meta-llama/Llama-3.1-70B-Instruct Text Generation β’ 71B β’ Updated Dec 15, 2024 β’ 707k β’ β’ 921
MIT/ast-finetuned-audioset-10-10-0.4593 Audio Classification β’ 86.6M β’ Updated Sep 6, 2023 β’ 300k β’ 355
openai/whisper-medium Automatic Speech Recognition β’ 0.8B β’ Updated Feb 29, 2024 β’ 682k β’ 285
facebook/data2vec-audio-large-100h Automatic Speech Recognition β’ 0.3B β’ Updated Jun 27, 2023 β’ 11 β’ 2
Running on Zero Agents Featured 5.07k MusicGen π΅ 5.07k Generate music from a text description and optional melody