Voice Cloning

#7
by isaiahbjork - opened

I created a repo for you to clone your voice with CSM-1B, results aren't the best but you can recognize the cloned voice.

https://github.com/isaiahbjork/csm-voice-cloning

Hi, I am trying voice cloning using the repository https://github.com/isaiahbjork/csm-voice-cloning
. When I provide an audio input of around 3–4 minutes, I encounter the following error:
ValueError: Inputs too long, must be below max_seq_len - max_audio_frames: 1861.

Could you please help me understand how to provide longer audio inputs to achieve better voice cloning accuracy? Additionally, could you let me know which languages are currently supported by this voice cloning approach?

Sign up or log in to comment