Generate speech audio from text using a reference voice
Generate natural speech from text using a reference voice