Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Duplicated from
microsoft/VibeVoice-ASR
williamchangtw
/
VibeVoice-ASR
like
0
Automatic Speech Recognition
Transformers
Safetensors
VibeVoice
51 languages
ASR
Transcriptoin
Diarization
Speech-to-Text
arxiv:
2601.18184
License:
mit
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
VibeVoice-ASR
/
figures
1.23 MB
2 contributors
History:
1 commit
williamchangtw
Duplicate from microsoft/VibeVoice-ASR
c1bd42e
about 23 hours ago
DER.jpg
Safe
62.7 kB
Duplicate from microsoft/VibeVoice-ASR
about 23 hours ago
VibeVoice_ASR_archi.png
Safe
149 kB
xet
Duplicate from microsoft/VibeVoice-ASR
about 23 hours ago
cpWER.jpg
Safe
68.5 kB
Duplicate from microsoft/VibeVoice-ASR
about 23 hours ago
language_distribution_horizontal.png
Safe
888 kB
xet
Duplicate from microsoft/VibeVoice-ASR
about 23 hours ago
tcpWER.jpg
Safe
64.3 kB
Duplicate from microsoft/VibeVoice-ASR
about 23 hours ago