TingChen-ppmc/Zhengzhou_Dialect_Conversational_Speech_Corpus Viewer • Updated Dec 20, 2023 • 2.01k • 82 • 3
TingChen-ppmc/Nanchang_Dialect_Conversational_Speech_Corpus Viewer • Updated Dec 20, 2023 • 1.67k • 29 • 1
TingChen-ppmc/Shanghai_Dialect_Conversational_Speech_Corpus Viewer • Updated May 31, 2024 • 3.79k • 106 • 9
TingChen-ppmc/Tianjin_Dialect_Conversational_Speech_Corpus Viewer • Updated May 31, 2024 • 5.17k • 30
TingChen-ppmc/Changsha_Dialect_Conversational_Speech_Corpus Viewer • Updated Dec 20, 2023 • 1.49k • 52 • 2
Running on Zero Agents 166 StyleTTS2: Ukrainian text to speech 🔈 166 StyleTTS2 trained on Ukrainian multispeaker dataset
BELLE-2/Belle-whisper-large-v3-turbo-zh Automatic Speech Recognition • 0.8B • Updated Dec 16, 2024 • 375 • 77
ibm-granite/granite-speech-3.2-8b Automatic Speech Recognition • 8B • Updated Apr 16, 2025 • 737 • 88
kotoba-tech/kotoba-whisper-v2.2 Automatic Speech Recognition • 0.8B • Updated Oct 23, 2024 • 220k • 108
MERaLiON/MERaLiON-AudioLLM-Whisper-SEA-LION Automatic Speech Recognition • 10B • Updated Feb 2 • 59 • 29
BELLE-2/Belle-whisper-large-v3-zh-punct Automatic Speech Recognition • 2B • Updated Apr 16, 2025 • 170 • 50
MohamedRashad/Arabic-Whisper-CodeSwitching-Edition Automatic Speech Recognition • 2B • Updated Jul 7, 2024 • 1.02k • 32
ghost613/whisper-large-v3-turbo-korean Automatic Speech Recognition • 0.8B • Updated Oct 25, 2024 • 853 • 15
mjwong/whisper-large-v3-turbo-singlish Automatic Speech Recognition • 0.8B • Updated May 3, 2025 • 71 • 2
facebook/wav2vec2-xlsr-53-espeak-cv-ft Automatic Speech Recognition • Updated Dec 10, 2021 • 329k • 49
padmalcom/wav2vec2-large-nonverbalvocalization-classification Audio Classification • Updated Jan 11, 2023 • 1.8k • 10
biodatlab/whisper-th-medium-combined Automatic Speech Recognition • 0.8B • Updated Feb 20, 2024 • 3.36k • 20
AudioLLMs/Multitask-National-Speech-Corpus-v1-extend Viewer • Updated Mar 31, 2025 • 15.2M • 12.8k • 5
nvidia/diar_sortformer_4spk-v1 Automatic Speech Recognition • 0.1B • Updated Dec 15, 2025 • 7.13k • 142
lelegu/omni-router-speechcrawl-streaming-asr-0.6b-v1 Automatic Speech Recognition • Updated Oct 15, 2025 • 1
TalTechNLP/whisper-large-v3-turbo-et-verbatim Automatic Speech Recognition • 0.9B • Updated Apr 29 • 138 • 3
nvidia/multitalker-parakeet-streaming-0.6b-v1 Automatic Speech Recognition • Updated Jan 28 • 595 • 112
mistralai/Voxtral-Mini-4B-Realtime-2602 Automatic Speech Recognition • 4B • Updated Mar 11 • 1.81M • 895
UsefulSensors/moonshine-streaming-tiny Automatic Speech Recognition • 44.1M • Updated Feb 10 • 7.32k • 12
formospeech/whisper-large-v2-taiwanese-hakka-v1 Automatic Speech Recognition • 2B • Updated May 12 • 2
CohereLabs/cohere-transcribe-03-2026 Automatic Speech Recognition • 2B • Updated 16 days ago • 743k • 1.02k
tiantiaf/voxlect-english-dialect-whisper-large-v3 Audio Classification • 2B • Updated Aug 10, 2025 • 88 • 2
ibm-granite/granite-speech-4.1-2b Automatic Speech Recognition • 2B • Updated 14 days ago • 412k • 145
ibm-granite/granite-speech-4.1-2b-plus Automatic Speech Recognition • 2B • Updated 10 days ago • 18.3k • 82
videosdk-live/Namo-Turn-Detector-v1-Multilingual Voice Activity Detection • Updated Oct 15, 2025 • 7.97k • 21
nvidia/nemotron-3.5-asr-streaming-0.6b Automatic Speech Recognition • Updated 10 days ago • 56.4k • • 702
PaddlePaddle/PP-OCRv6_medium_rec_safetensors Image-to-Text • 19.2M • Updated 14 days ago • 1.41k • 20