StreamAvatar AROD Student Checkpoint

This repository hosts the public StreamAvatar AROD real-anchor student checkpoint.

AROD stands for Autoregressive One-step Denoising. It is a blockwise student model distilled from a DyStream teacher for faster audio-to-motion inference in the StreamAvatar project.

Files

  • blockwise_latest.pt: AROD real-anchor student checkpoint.
  • config.yaml: sanitized inference/training configuration for the checkpoint.

Download

pip install huggingface-hub

mkdir -p outputs/blockwise_stream_distill_cross_fm_teacher_cache_anchor_pretrain_60k
huggingface-cli download pancx/StreamAvatar-AROD blockwise_latest.pt \
  --local-dir outputs/blockwise_stream_distill_cross_fm_teacher_cache_anchor_pretrain_60k

Expected SHA256:

01893fabb842fcc8e9817a8e2530108d75932aad4f6ac4136e5c22b94702e860

Project

Code and full setup instructions are available at:

https://github.com/CXP-2024/StreamAvatar

The checkpoint requires the StreamAvatar/DyStream codebase, the original DyStream teacher checkpoint, Wav2Vec2 assets, and the renderer checkpoint described in the project README.

Downloads last month
11
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support