NumlockUknowSth commited on
Commit
68d6296
Β·
verified Β·
1 Parent(s): 02cc019

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +82 -0
README.md ADDED
@@ -0,0 +1,82 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ datasets:
4
+ - NumlockUknowSth/Cine250K
5
+ language:
6
+ - en
7
+ pipeline_tag: text-to-video
8
+ tags:
9
+ - multi-shot
10
+ ---
11
+
12
+ <div align="center">
13
+
14
+ <h1>CineTrans: Learning to Generate Videos with Cinematic Transitions via Masked Diffusion Models</h1>
15
+
16
+ [![](https://img.shields.io/static/v1?label=CineTrans&message=Project&color=purple)](https://uknowsth.github.io/CineTrans/)   [![](https://img.shields.io/static/v1?label=Paper&message=Arxiv&color=red&logo=arxiv)](https://arxiv.org/abs/2508.11484)   [![](https://img.shields.io/static/v1?label=Code&message=Github&color=blue&logo=github)](https://github.com/Vchitect/CineTrans)   [![](https://img.shields.io/static/v1?label=Dataset&message=HuggingFace&color=yellow&logo=huggingface)](https://huggingface.co/datasets/NumlockUknowSth/Cine250K)   
17
+
18
+
19
+ <p><a href="https://scholar.google.com/citations?hl=zh-CN&user=TbZZSVgAAAAJ">Xiaoxue Wu</a><sup>1,2*</sup>,
20
+ <a href="https://scholar.google.com/citations?user=0gY2o7MAAAAJ&amp;hl=zh-CN" target="_blank">Bingjie Gao</a><sup>2,3</sup>,
21
+ <a href="https://scholar.google.com.hk/citations?user=gFtI-8QAAAAJ&amp;hl=zh-CN">Yu Qiao</a><sup>2&dagger;</sup>,
22
+ <a href="https://wyhsirius.github.io/">Yaohui Wang</a><sup>2&dagger;</sup>,
23
+ <a href="https://scholar.google.com/citations?user=3fWSC8YAAAAJ">Xinyuan Chen</a><sup>2&dagger;</sup></p>
24
+
25
+
26
+ <span class="author-block"><sup>1</sup>Fudan University</span>
27
+ <span class="author-block"><sup>2</sup>Shanghai Artificial Intelligence Laboratory</span>
28
+ <span class="author-block"><sup>3</sup>Shanghai Jiao Tong University</span>
29
+
30
+
31
+ <span class="author-block"><sup>*</sup>Work done during internship at Shanghai AI Laboratory</span> <span class="author-block"><sup>&dagger;</sup>Corresponding author</span>
32
+
33
+ </div>
34
+
35
+
36
+ ## πŸ“₯ Installation
37
+ 1. Clone the Repository
38
+ ```
39
+ git clone https://github.com/UknowSth/CineTrans.git
40
+ cd CineTrans
41
+ ```
42
+ 2. Set up Environment
43
+ ```
44
+ conda create -n cinetrans python==3.11.9
45
+ conda activate cinetrans
46
+
47
+ pip install torch==2.5.1 torchvision==0.20.1 --index-url https://download.pytorch.org/whl/cu118
48
+ pip install -r requirements.txt
49
+ ```
50
+
51
+ ## πŸ€— Checkpoint
52
+ ### CineTrans-Unet
53
+ Download the required [model weights](https://huggingface.co/NumlockUknowSth/CineTrans-Unet/tree/main) and place them in the `ckpt/` directory.
54
+ ```
55
+ ckpt/
56
+ │── stable-diffusion-v1-4/
57
+ β”‚ β”œβ”€β”€ scheduler/
58
+ β”‚ β”œβ”€β”€ text_encoder/
59
+ β”‚ β”œβ”€β”€ tokenizer/
60
+ β”‚ │── unet/
61
+ β”‚ └── vae_temporal_decoder/
62
+ │── checkpoint.pt
63
+ │── longclip-L.pt
64
+ ```
65
+
66
+ For more inference details, please refer to the [GitHub repository](https://github.com/Vchitect/CineTrans).
67
+
68
+ ---
69
+
70
+ ## πŸ“‘ BiTeX
71
+ If you find [CineTrans](https://github.com/Vchitect/CineTrans.git) useful for your research and applications, please cite using this BibTeX:
72
+ ```
73
+ @misc{wu2025cinetranslearninggeneratevideos,
74
+ title={CineTrans: Learning to Generate Videos with Cinematic Transitions via Masked Diffusion Models},
75
+ author={Xiaoxue Wu and Bingjie Gao and Yu Qiao and Yaohui Wang and Xinyuan Chen},
76
+ year={2025},
77
+ eprint={2508.11484},
78
+ archivePrefix={arXiv},
79
+ primaryClass={cs.CV},
80
+ url={https://arxiv.org/abs/2508.11484},
81
+ }
82
+ ```