File size: 347 Bytes
8e4cbe2
 
 
93ac639
6a53ded
 
 
 
 
93ac639
565ba55
07fa538
1d8647f
 
26df677
1d8647f
26df677
1d8647f
 
565ba55
6a53ded
 
 
1d8647f
6a53ded
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25

https://github.com/bytedance/MegaTTS3


- models
- 
For the VAE encoder model, which enables custom voice cloning without .npy files    



```

└─wavvae
        config.yaml
        
        decoder.ckpt
        
        model_only_last.ckpt

```
- script
- 
python generate_npy.py --input_wav assets/xxx.wav --output_npy assets/xxx.npy