|
|
--- |
|
|
title: MeiGen MultiTalk Demo |
|
|
emoji: 🎬 |
|
|
colorFrom: red |
|
|
colorTo: blue |
|
|
sdk: streamlit |
|
|
sdk_version: 1.28.1 |
|
|
app_file: app.py |
|
|
pinned: false |
|
|
license: apache-2.0 |
|
|
--- |
|
|
|
|
|
# MeiGen-MultiTalk Demo |
|
|
|
|
|
This is a demo of MeiGen-MultiTalk, an audio-driven multi-person conversational video generation model. |
|
|
|
|
|
## Features |
|
|
|
|
|
- 💬 Generate videos of people talking from still images and audio |
|
|
- 👥 Support for both single-person and multi-person conversations |
|
|
- 🎯 High-quality lip synchronization |
|
|
- 📺 Support for 480p and 720p resolution |
|
|
- ⏱️ Generate videos up to 15 seconds long |
|
|
|
|
|
## How to Use |
|
|
|
|
|
1. Upload a reference image (photo of person(s) who will be speaking) |
|
|
2. Upload an audio file |
|
|
3. Enter a prompt describing the desired video |
|
|
4. Click "Generate Video" to process |
|
|
|
|
|
## Tips |
|
|
|
|
|
- Use clear, front-facing photos for best results |
|
|
- Ensure good audio quality without background noise |
|
|
- Keep prompts clear and specific |
|
|
- Supported formats: PNG, JPG, JPEG for images; MP3, WAV, OGG for audio |
|
|
|
|
|
## Limitations |
|
|
|
|
|
- Generation can take several minutes |
|
|
- Maximum video duration is 15 seconds |
|
|
- Best results with clear, well-lit reference images |
|
|
- Audio should be clear and without background noise |
|
|
|
|
|
## Credits |
|
|
|
|
|
This demo uses the MeiGen-MultiTalk model created by MeiGen-AI. |
|
|
|
|
|
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference |