Generate lip-synced video from audio and reference video
Create real-time lip-synchronized videos from audio