PersonaLive!

Expressive Portrait Image Animation for Live Streaming

GitHub

Zhiyuan Li1,2,3 Β· Chi-Man Pun1,πŸ“ͺ Β· Chen Fang2 Β· Jue Wang2 Β· Xiaodong Cun3,πŸ“ͺ

1 University of Macau    2 Dzine.ai    3 GVC Lab, Great Bay University

⚑️ Real-time, Streamable, Infinite-Length ⚑️
⚑️ Portrait Animation requires only ~12GB VRAM ⚑️

πŸ“‹ TODO

  • If you find PersonaLive useful or interesting, please give us a Star 🌟 on our GitHub repo! Your support drives us to keep improving. 🍻
  • Fix bugs (If you encounter any issues, please feel free to open an issue or contact me! πŸ™)
  • Enhance WebUI (Support reference image replacement).
  • [2025.12.17] πŸ”₯ ComfyUI-PersonaLive is now supported! (Thanks to @okdalto)
  • [2025.12.15] πŸ”₯ Release paper!
  • [2025.12.12] πŸ”₯ Release inference code, config, and pretrained weights!

βš™οΈ Framework

Image 1

We present PersonaLive, a real-time and streamable diffusion framework capable of generating infinite-length portrait animations on a single 12GB GPU.

πŸš€ Getting Started

πŸ›  Installation

# clone this repo
git clone https://github.com/GVCLab/PersonaLive
cd PersonaLive

# Create conda environment
conda create -n personalive python=3.10
conda activate personalive

# Install packages with pip
pip install -r requirements_base.txt

⏬ Download weights

Option 1: Download pre-trained weights of base models and other components (sd-image-variations-diffusers and sd-vae-ft-mse). You can run the following command to download weights automatically:

python tools/download_weights.py

Option 2: Download pre-trained weights into the ./pretrained_weights folder from one of the below URLs:

Finally, these weights should be organized as follows:

pretrained_weights
β”œβ”€β”€ onnx
β”‚   β”œβ”€β”€ unet_opt
β”‚   β”‚   β”œβ”€β”€ unet_opt.onnx
β”‚   β”‚   └── unet_opt.onnx.data
β”‚   └── unet
β”œβ”€β”€ personalive
β”‚   β”œβ”€β”€ denoising_unet.pth
β”‚   β”œβ”€β”€ motion_encoder.pth
β”‚   β”œβ”€β”€ motion_extractor.pth
β”‚   β”œβ”€β”€ pose_guider.pth
β”‚   β”œβ”€β”€ reference_unet.pth
β”‚   └── temporal_module.pth
β”œβ”€β”€ sd-vae-ft-mse
β”‚   β”œβ”€β”€ diffusion_pytorch_model.bin
β”‚   └── config.json
β”œβ”€β”€ sd-image-variations-diffusers
β”‚   β”œβ”€β”€ image_encoder
β”‚   β”‚   β”œβ”€β”€ pytorch_model.bin
β”‚   β”‚   └── config.json
β”‚   β”œβ”€β”€ unet
β”‚   β”‚   β”œβ”€β”€ diffusion_pytorch_model.bin
β”‚   β”‚   └── config.json
β”‚   └── model_index.json
└── tensorrt
    └── unet_work.engine

🎞️ Offline Inference

python inference_offline.py

⚠️ Note for RTX 50-Series (Blackwell) Users: xformers is not yet fully compatible with the new architecture. To avoid crashes, please disable it by running:

python inference_offline.py --use_xformers False

πŸ“Έ Online Inference

πŸ“¦ Setup Web UI

# install Node.js 18+
curl -o- https://raw.githubusercontent.com/nvm-sh/nvm/v0.39.1/install.sh | bash
nvm install 18

cd webcam
source start.sh

🏎️ Acceleration (Optional)

Converting the model to TensorRT can significantly speed up inference (~ 2x ⚑️). Building the engine may take about 20 minutes depending on your device. Note that TensorRT optimizations may lead to slight variations or a small drop in output quality.

pip install -r requirement_trt.txt

python torch2trt.py

The provided TensorRT model is from an H100. We recommend ALL users (including H100 users) re-run python torch2trt.py locally to ensure best compatibility.

▢️ Start Streaming

python inference_online.py --acceleration none (for RTX 50-Series) or xformers or tensorrt

Then open http://0.0.0.0:7860 in your browser. (*If http://0.0.0.0:7860 does not work well, try http://localhost:7860)

How to use: Upload Image ➑️ Fuse Reference ➑️ Start Animation ➑️ Enjoy! πŸŽ‰

PersonaLive

Regarding Latency: Latency varies depending on your device's computing power. You can try the following methods to optimize it:

  1. Lower the "Driving FPS" setting in the WebUI to reduce the computational workload.
  2. You can increase the multiplier (e.g., set to num_frames_needed * 4 or higher) to better match your device's inference speed. https://github.com/GVCLab/PersonaLive/blob/6953d1a8b409f360a3ee1d7325093622b29f1e22/webcam/util.py#L73

πŸ“š Community Contribution

Special thanks to the community for providing helpful setups! πŸ₯‚

⭐ Citation

If you find PersonaLive useful for your research, welcome to cite our work using the following BibTeX:

@article{li2025personalive,
  title={PersonaLive! Expressive Portrait Image Animation for Live Streaming},
  author={Li, Zhiyuan and Pun, Chi-Man and Fang, Chen and Wang, Jue and Cun, Xiaodong},
  journal={arXiv preprint arXiv:2512.11253},
  year={2025}
}

❀️ Acknowledgement

This code is mainly built upon Moore-AnimateAnyone, X-NeMo, StreamDiffusion, RAIN and LivePortrait, thanks to their invaluable contributions.

Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support