Generate speech in a chosen voice from text and audio prompt
Real-time in-browser speech recognition
Generate AI descriptions of live video or video files