jackyhate commited on
Commit
c483c3a
·
verified ·
1 Parent(s): f327b95

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -11
README.md CHANGED
@@ -1,21 +1,17 @@
 
 
 
 
 
 
1
  <p align="center">
2
  <h1 align="center">HiAR</h1>
3
  <h3 align="center">Hierarchical Autoregressive Video Generation with Pipelined Parallel Inference</h3>
4
  </p>
5
  <p align="center">
6
- <h3 align="center"><a href="#">Paper</a> | <a href="#">Website</a> | <a href="#">Models (HuggingFace)</a></h3>
7
  </p>
8
 
9
  ---
10
 
11
  HiAR proposes **hierarchical denoising** for autoregressive video diffusion models, a paradigm shift from conventional block-first to **step-first** denoising order. By conditioning each block on context at a matched noise level, HiAR maximally attenuates error propagation while preserving temporal causality, achieving **state-of-the-art long video generation** (20s+) with significantly reduced quality drift.
12
-
13
- Key features:
14
- - **Hierarchical Denoising**: Step-first denoising order with noisy context conditioning at matched noise levels
15
- - **Pipelined Parallel Inference**: Exploits the hierarchical structure for wall-clock speedup via multi-GPU pipeline parallelism
16
- - **Forward-KL Regularization**: Prevents low-motion shortcuts in reverse-KL distillation
17
- - **4-step generation**: Real-time streaming video generation on a single GPU
18
-
19
- ---
20
- license: mit
21
- ---
 
1
+ ---
2
+ license: mit
3
+ base_model:
4
+ - Wan-AI/Wan2.1-T2V-1.3B
5
+ pipeline_tag: text-to-video
6
+ ---
7
  <p align="center">
8
  <h1 align="center">HiAR</h1>
9
  <h3 align="center">Hierarchical Autoregressive Video Generation with Pipelined Parallel Inference</h3>
10
  </p>
11
  <p align="center">
12
+ <h3 align="center"><a href="#">Paper</a> | <a href="#">Website</a> </h3>
13
  </p>
14
 
15
  ---
16
 
17
  HiAR proposes **hierarchical denoising** for autoregressive video diffusion models, a paradigm shift from conventional block-first to **step-first** denoising order. By conditioning each block on context at a matched noise level, HiAR maximally attenuates error propagation while preserving temporal causality, achieving **state-of-the-art long video generation** (20s+) with significantly reduced quality drift.