Generate cloned speech from text and reference audio
MegaTTS 3 but with voice cloning!
Generate realistic cloned speech from text and reference audio