Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
microsoft
/
VibeVoice-ASR-HF
like
6
Follow
Microsoft
18.6k
Audio-Text-to-Text
Transformers
Safetensors
51 languages
vibevoice_asr
automatic-speech-recognition
ASR
Diarization
Speech-to-Text
Transcription
arxiv:
2601.18184
License:
mit
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
VibeVoice-ASR-HF
16.7 GB
1 contributor
History:
2 commits
frontierai
Initial commit
8623f43
verified
about 16 hours ago
figures
Initial commit
about 16 hours ago
.gitattributes
1.72 kB
Initial commit
about 16 hours ago
README.md
22 kB
Initial commit
about 16 hours ago
chat_template.jinja
1.24 kB
Initial commit
about 16 hours ago
config.json
2.92 kB
Initial commit
about 16 hours ago
generation_config.json
281 Bytes
Initial commit
about 16 hours ago
model-00001-of-00008.safetensors
2.49 GB
xet
Initial commit
about 16 hours ago
model-00002-of-00008.safetensors
2.39 GB
xet
Initial commit
about 16 hours ago
model-00003-of-00008.safetensors
2.47 GB
xet
Initial commit
about 16 hours ago
model-00004-of-00008.safetensors
2.47 GB
xet
Initial commit
about 16 hours ago
model-00005-of-00008.safetensors
2.5 GB
xet
Initial commit
about 16 hours ago
model-00006-of-00008.safetensors
1.83 GB
xet
Initial commit
about 16 hours ago
model-00007-of-00008.safetensors
2.48 GB
xet
Initial commit
about 16 hours ago
model-00008-of-00008.safetensors
37.3 MB
xet
Initial commit
about 16 hours ago
model.safetensors.index.json
92 kB
Initial commit
about 16 hours ago
processor_config.json
537 Bytes
Initial commit
about 16 hours ago
tokenizer.json
11.4 MB
xet
Initial commit
about 16 hours ago
tokenizer_config.json
714 Bytes
Initial commit
about 16 hours ago