Spaces:

ShalomKing
/

infinitetalk

Running

App Files Files Community

infinitetalk / README.md

ShalomKing

Upload README.md with huggingface_hub

6b107bd verified 13 days ago

preview code

raw

history blame contribute delete

1.87 kB

A newer version of the Gradio SDK is available: 6.1.0

Upgrade

metadata

title: InfiniteTalk - Talking Video Generator
emoji: 🎬
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 5.6.0
python_version: '3.10'
app_file: app.py
pinned: false
license: apache-2.0
short_description: AI talking video generator with accurate lip-sync

InfiniteTalk - Talking Video Generator

Generate realistic talking head videos with accurate lip-sync from images or dub existing videos with new audio!

Features

Image-to-Video: Transform a static portrait image into a talking video using audio input
Video Dubbing: Re-sync an existing video with new audio while maintaining natural head movements and expressions
High Quality: 480p and 720p resolution support with advanced lip-sync technology
Unlimited Length: Support for videos of any duration through chunked processing

How It Works

InfiniteTalk uses the state-of-the-art Wan2.1 diffusion model combined with specialized audio conditioning to create photorealistic talking videos. The system synchronizes:

Lip movements with audio
Head pose and rotations
Facial expressions
Body posture

Usage

Image-to-Video

Upload a portrait image (clear face visibility recommended)
Upload an audio file or use the example
Adjust parameters if needed
Click Generate

Video Dubbing

Upload a video with a visible face
Upload new audio to dub over it
Adjust parameters if needed
Click Generate

Parameters

Resolution: Choose between 480p (faster) or 720p (higher quality)
Diffusion Steps: More steps = higher quality but slower (20-50 recommended)
Audio Guide Scale: Controls audio influence on generation (2-4 recommended)
Seed: For reproducible results

Credits

Built on InfiniteTalk by MeiGen-AI.

License

Apache 2.0 - See LICENSE.txt for details