KDTalker
π
154
Generate a talking-head video from a face image and audio
Extract text from images using various OCR modes
Interact with a chatbot that understands text and images
Generate speech from text using a reference voice
Generate customized captions for any image
Ultra-high resolution image synthesis