Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
drbaph
/
MegaTTS3-WaveVAE
like
5
Text-to-Speech
Transformers
Safetensors
PyTorch
tts
voice-cloning
speech-synthesis
audio
chinese
english
zero-shot
diffusion
arxiv:
2502.18924
arxiv:
2408.16532
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
MegaTTS3-WaveVAE
File size: 72 Bytes
4bd73f1
1
{
"framework"
:
"pytorch"
,
"task"
:
"text-to-speech"
,
"allow_remote"
:
true
}