nielsr HF Staff commited on
Commit
ef96501
·
verified ·
1 Parent(s): 0c443f6

Improve model card with paper links, metadata and usage

Browse files

This PR enhances the model card for rCM. It adds:
- Links to the research paper, official project page, and GitHub repository.
- Descriptive metadata tags for better discoverability.
- A sample usage section based on the official inference script provided in the repository.
- A clear overview of the model's capabilities in few-step video generation.

Files changed (1) hide show
  1. README.md +26 -4
README.md CHANGED
@@ -1,20 +1,42 @@
1
  ---
2
- license: apache-2.0
3
  base_model:
4
  - Wan-AI/Wan2.1-T2V-1.3B
5
  - Wan-AI/Wan2.1-T2V-14B
 
6
  pipeline_tag: text-to-video
 
 
 
 
 
7
  ---
 
8
  # rCM: Score-Regularized Continuous-Time Consistency Model
 
9
  [**Paper**](https://arxiv.org/abs/2510.08431) | [**Website**](https://research.nvidia.com/labs/dir/rcm) | [**Code**](https://github.com/NVlabs/rcm)
10
 
11
- This repo holds unofficial rCM models for Wan, reproduced by Tsinghua University.
12
 
13
- The Wan2.2 rCM checkpoints are obtained by merging Wan2.1 rCM weights to Wan2.2 checkpoints, no extra training included. Should have the same effect as directly using the Wan2.1 rCM LoRAs (https://huggingface.co/Kijai/WanVideo_comfy/tree/main/LoRAs/rCM) and adjusting the strength.
14
 
15
- ## Citation
16
 
 
 
 
 
 
 
 
 
 
 
 
17
  ```
 
 
 
 
18
  @article{zheng2025rcm,
19
  title={Large Scale Diffusion Distillation via Score-Regularized Continuous-Time Consistency},
20
  author={Zheng, Kaiwen and Wang, Yuji and Ma, Qianli and Chen, Huayu and Zhang, Jintao and Balaji, Yogesh and Chen, Jianfei and Liu, Ming-Yu and Zhu, Jun and Zhang, Qinsheng},
 
1
  ---
 
2
  base_model:
3
  - Wan-AI/Wan2.1-T2V-1.3B
4
  - Wan-AI/Wan2.1-T2V-14B
5
+ license: apache-2.0
6
  pipeline_tag: text-to-video
7
+ tags:
8
+ - rcm
9
+ - consistency-models
10
+ - diffusion-distillation
11
+ - video-generation
12
  ---
13
+
14
  # rCM: Score-Regularized Continuous-Time Consistency Model
15
+
16
  [**Paper**](https://arxiv.org/abs/2510.08431) | [**Website**](https://research.nvidia.com/labs/dir/rcm) | [**Code**](https://github.com/NVlabs/rcm)
17
 
18
+ rCM is a framework for scaling up continuous-time consistency distillation to large-scale video diffusion models (up to 14B parameters). It enables high-fidelity video generation in only 1–4 steps, accelerating diffusion sampling by $15\times \sim 50\times$. This repository contains unofficial rCM models for Wan, reproduced by Tsinghua University.
19
 
20
+ The Wan2.2 rCM checkpoints are obtained by merging Wan2.1 rCM weights to Wan2.2 checkpoints, no extra training included. This should have the same effect as directly using the Wan2.1 rCM LoRAs and adjusting the strength.
21
 
22
+ ## Inference
23
 
24
+ To run the models, please refer to the environment setup in the [official GitHub repository](https://github.com/NVlabs/rcm).
25
+
26
+ Below is an example inference script for running rCM on T2V as found in the documentation:
27
+
28
+ ```bash
29
+ # Example for Wan2.1 T2V 1.3B
30
+ PYTHONPATH=. python rcm/inference/wan2pt1_t2v_rcm_infer.py \
31
+ --model_size 1.3B \
32
+ --dit_path assets/checkpoints/rCM_Wan2.1_T2V_1.3B_480p.pt \
33
+ --num_samples 5 \
34
+ --prompt "A cinematic shot of a snowy mountain at sunrise"
35
  ```
36
+
37
+ ## Citation
38
+
39
+ ```bibtex
40
  @article{zheng2025rcm,
41
  title={Large Scale Diffusion Distillation via Score-Regularized Continuous-Time Consistency},
42
  author={Zheng, Kaiwen and Wang, Yuji and Ma, Qianli and Chen, Huayu and Zhang, Jintao and Balaji, Yogesh and Chen, Jianfei and Liu, Ming-Yu and Zhu, Jun and Zhang, Qinsheng},