--- license: other tags: - audio-driven-animation - talking-head - streamavatar - dystream - ar-od pipeline_tag: image-to-video --- # StreamAvatar AROD Student Checkpoint This repository hosts the public StreamAvatar AROD real-anchor student checkpoint. AROD stands for Autoregressive One-step Denoising. It is a blockwise student model distilled from a DyStream teacher for faster audio-to-motion inference in the StreamAvatar project. ## Files - `blockwise_latest.pt`: AROD real-anchor student checkpoint. - `config.yaml`: sanitized inference/training configuration for the checkpoint. ## Download ```bash pip install huggingface-hub mkdir -p outputs/blockwise_stream_distill_cross_fm_teacher_cache_anchor_pretrain_60k huggingface-cli download pancx/StreamAvatar-AROD blockwise_latest.pt \ --local-dir outputs/blockwise_stream_distill_cross_fm_teacher_cache_anchor_pretrain_60k ``` Expected SHA256: ```text 01893fabb842fcc8e9817a8e2530108d75932aad4f6ac4136e5c22b94702e860 ``` ## Project Code and full setup instructions are available at: ```text https://github.com/CXP-2024/StreamAvatar ``` The checkpoint requires the StreamAvatar/DyStream codebase, the original DyStream teacher checkpoint, Wav2Vec2 assets, and the renderer checkpoint described in the project README.