Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
drbaph
/
MegaTTS3-WaveVAE
like
5
Text-to-Speech
Transformers
Safetensors
PyTorch
tts
voice-cloning
speech-synthesis
audio
chinese
english
zero-shot
diffusion
arxiv:
2502.18924
arxiv:
2408.16532
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
MegaTTS3-WaveVAE
/
diffusion_transformer
Commit History
Upload 24 files
4bd73f1
verified
drbaph
commited on
Jul 22, 2025