Generate a talking face video from an image and audio
Generate realistic audio from text
Display a loading screen with a spinner