https://github.com/bytedance/MegaTTS3


- models
- 
For the VAE encoder model, which enables custom voice cloning without .npy files    


```

└─wavvae
        config.yaml
        
        decoder.ckpt
        
        model_only_last.ckpt

```
- script
- 
python generate_npy.py --input_wav assets/xxx.wav --output_npy assets/xxx.npy