Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
k2-fsa
/
ZipVoice
like
44
Follow
k2-fsa
205
Text-to-Speech
ONNX
Safetensors
k2-fsa/OpenDialog
amphion/Emilia-Dataset
k2-fsa/TTS_eval_datasets
English
Chinese
arxiv:
2506.13053
arxiv:
2507.09318
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
12
refs/pr/7
ZipVoice
/
zipvoice
1.61 GB
5 contributors
History:
5 commits
zhu-han
Upload 7 files
7ca8bc5
verified
7 months ago
fm_decoder.onnx
477 MB
xet
Upload 16 files
7 months ago
fm_decoder_int8.onnx
124 MB
xet
Upload 16 files
7 months ago
model.json
697 Bytes
Upload 7 files
7 months ago
model.pt
491 MB
xet
Upload 16 files
7 months ago
model.safetensors
491 MB
xet
Upload 16 files
7 months ago
text_encoder.onnx
17.6 MB
xet
Upload 16 files
7 months ago
text_encoder_int8.onnx
5.54 MB
xet
Upload 16 files
7 months ago
tokens.txt
2.57 kB
Upload 16 files
7 months ago
zipvoice_base.json
697 Bytes
Upload 7 files
7 months ago