StreamAvatar-AROD / README.md
pancx's picture
Upload AROD checkpoint
64a0475 verified
|
Raw
History Blame Contribute Delete
1.29 kB
metadata
license: other
tags:
  - audio-driven-animation
  - talking-head
  - streamavatar
  - dystream
  - ar-od
pipeline_tag: image-to-video

StreamAvatar AROD Student Checkpoint

This repository hosts the public StreamAvatar AROD real-anchor student checkpoint.

AROD stands for Autoregressive One-step Denoising. It is a blockwise student model distilled from a DyStream teacher for faster audio-to-motion inference in the StreamAvatar project.

Files

  • blockwise_latest.pt: AROD real-anchor student checkpoint.
  • config.yaml: sanitized inference/training configuration for the checkpoint.

Download

pip install huggingface-hub

mkdir -p outputs/blockwise_stream_distill_cross_fm_teacher_cache_anchor_pretrain_60k
huggingface-cli download pancx/StreamAvatar-AROD blockwise_latest.pt \
  --local-dir outputs/blockwise_stream_distill_cross_fm_teacher_cache_anchor_pretrain_60k

Expected SHA256:

01893fabb842fcc8e9817a8e2530108d75932aad4f6ac4136e5c22b94702e860

Project

Code and full setup instructions are available at:

https://github.com/CXP-2024/StreamAvatar

The checkpoint requires the StreamAvatar/DyStream codebase, the original DyStream teacher checkpoint, Wav2Vec2 assets, and the renderer checkpoint described in the project README.