Generate a talking face video from an image and audio
Generate spoken audio from text in multiple languages
Generate edited video frames using text prompts