generate a video from an image with a text prompt
Generate animated talking portrait from image and audio