High quality image generation in 3 second
Generate speech from text using a reference voice
Convert text to speech