Generate speech from text using a reference voice
Generate spoken audio from text using selectable voices