Text-to-Video
worstcoder commited on
Commit
18554de
·
verified ·
1 Parent(s): 4498433

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -2
README.md CHANGED
@@ -6,11 +6,12 @@ base_model:
6
  pipeline_tag: text-to-video
7
  ---
8
  # rCM: Score-Regularized Continuous-Time Consistency Model
 
9
  [**Paper**](https://arxiv.org/abs/2510.08431) | [**Website**](https://research.nvidia.com/labs/dir/rcm) | [**Code**](https://github.com/NVlabs/rcm)
10
 
11
- This repo holds converted Wan official checkpoints in rCM/TurboDiffusion style.
12
 
13
- Specifically, rCM equivalently replaces the Conv3d layer in the original Wan with a Linear layer for patch embedding, facilitating further optimization. The layer weight is directly reshaped without value change, e.g., from shape [5120, 16, 1, 2, 2] (Conv3d) to shape [5120, 64] (Linear).
14
 
15
  ## Citation
16
 
@@ -21,4 +22,10 @@ Specifically, rCM equivalently replaces the Conv3d layer in the original Wan wit
21
  journal={arXiv preprint arXiv:2510.08431},
22
  year={2025}
23
  }
 
 
 
 
 
 
24
  ```
 
6
  pipeline_tag: text-to-video
7
  ---
8
  # rCM: Score-Regularized Continuous-Time Consistency Model
9
+ # Causal-rCM: Teacher-Forcing meets Self-Forcing in Autoregressive Diffusion Distillation for Streaming Video Generation and Interactive World Models
10
  [**Paper**](https://arxiv.org/abs/2510.08431) | [**Website**](https://research.nvidia.com/labs/dir/rcm) | [**Code**](https://github.com/NVlabs/rcm)
11
 
12
+ This repo holds converted Wan official checkpoints in (Causal-)rCM/TurboDiffusion style.
13
 
14
+ Specifically, (Causal-)rCM equivalently replaces the `Conv3d` layer in the original Wan with a `Linear` layer for patch embedding, facilitating further optimization. The layer weight is directly reshaped without value change, e.g., from shape [5120, 16, 1, 2, 2] (Conv3d) to shape [5120, 64] (Linear).
15
 
16
  ## Citation
17
 
 
22
  journal={arXiv preprint arXiv:2510.08431},
23
  year={2025}
24
  }
25
+ @article{zheng2026causal,
26
+ title={Causal-rCM: Teacher-Forcing meets Self-Forcing in Autoregressive Diffusion Distillation for Streaming Video Generation and Interactive World Models},
27
+ author={Zheng, Kaiwen and He, Guande and Zhao, Min and Zhu, Hongzhou and Zhang, Jintao and Chen, Huayu and Chen, Jianfei and Lin, Chen-Hsuan and Liu, Ming-Yu and Zhu, Jun and Ma, Qianli},
28
+ journal={TODO},
29
+ year={2026}
30
+ }
31
  ```