https://github.com/bytedance/MegaTTS3 - models - For the VAE encoder model, which enables custom voice cloning without .npy files ``` └─wavvae config.yaml decoder.ckpt model_only_last.ckpt ``` - script - python generate_npy.py --input_wav assets/xxx.wav --output_npy assets/xxx.npy