Generate a talking face video from an image and audio
Generate spoken audio from text using selectable voices
Generate edited video frames using text prompts