Transcribe audio files or YouTube videos into text
Generate and convert voice using text and audio inputs