File size: 347 Bytes
8e4cbe2 93ac639 6a53ded 93ac639 565ba55 07fa538 1d8647f 26df677 1d8647f 26df677 1d8647f 565ba55 6a53ded 1d8647f 6a53ded |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 |
https://github.com/bytedance/MegaTTS3
- models
-
For the VAE encoder model, which enables custom voice cloning without .npy files
```
└─wavvae
config.yaml
decoder.ckpt
model_only_last.ckpt
```
- script
-
python generate_npy.py --input_wav assets/xxx.wav --output_npy assets/xxx.npy
|