File size: 2,905 Bytes
cd6aef2
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
---
license: mit
---


<div align="center">
  
<h1>ShotDirector: Directorially Controllable Multi-Shot Video Generation with Cinematographic Transitions</h1>

[![](https://img.shields.io/static/v1?label=ShotDirector&message=Project&color=purple)](https://github.com/UknowSth/ShotDirector)   [![](https://img.shields.io/static/v1?label=Paper&message=Arxiv&color=red&logo=arxiv)](https://arxiv.org/abs/2512.10286)   [![](https://img.shields.io/static/v1?label=Code&message=Github&color=blue&logo=github)](https://github.com/UknowSth/ShotDirector)    

                    
<p><a href="https://scholar.google.com/citations?hl=zh-CN&user=TbZZSVgAAAAJ">Xiaoxue Wu</a><sup>1,2*</sup>,
<a href="https://scholar.google.com/citations?user=3fWSC8YAAAAJ">Xinyuan Chen</a><sup>2&dagger;</sup>,
<a href="https://wyhsirius.github.io/">Yaohui Wang</a><sup>2&dagger;</sup>,
<a href="https://scholar.google.com.hk/citations?user=gFtI-8QAAAAJ&amp;hl=zh-CN">Yu Qiao</a><sup>2&dagger;</sup>,
</p>


<span class="author-block"><sup>1</sup>Fudan University</span>
<span class="author-block"><sup>2</sup>Shanghai Artificial Intelligence Laboratory</span>


<span class="author-block"><sup>*</sup>Work done during internship at Shanghai AI Laboratory</span> <span class="author-block"><sup>&dagger;</sup>Corresponding author</span>

</div>

## πŸ“₯ Installation
1. Clone the Repository
```
git clone https://github.com/UknowSth/ShotDirector.git
cd ShotDirector
```
2. Set up Environment
```
conda create -n shotdirector python==3.11.9
conda activate shotdirector

pip install torch==2.5.1 torchvision==0.20.1 --index-url https://download.pytorch.org/whl/cu118
pip install -r requirements.txt
```

## πŸ€— Checkpoint  

### CineTrans-DiT
Download the weights of [Wan2.1-T2V-1.3B](https://huggingface.co/Wan-AI/Wan2.1-T2V-1.3B) and the weights required for Shotdirector. Place them in the `.ckpt/` folder as shown in the following diagram.

```
ckpt/
│── Wan2.1/Wan2.1-T2V-1.3B/
β”‚   β”œβ”€β”€ config.json
β”‚   β”œβ”€β”€ diffusion_pytorch_model.safetensors
β”‚   β”œβ”€β”€ google/  
β”‚   │── models_t5_umt5-xxl-enc-bf16.pth
β”‚   └── Wan2.1_VAE.pth
│── encoder.pt
│── model.pt
│── trans.pt
```

For more inference details, please refer to our [GitHub repository](https://github.com/UknowSth/ShotDirector).

## πŸ“‘ BiTeX  
If you find [ShotDirector](https://github.com/UknowSth/ShotDirector.git) useful for your research and applications, please cite using this BibTeX:
```
@misc{wu2025shotdirectordirectoriallycontrollablemultishot,
      title={ShotDirector: Directorially Controllable Multi-Shot Video Generation with Cinematographic Transitions}, 
      author={Xiaoxue Wu and Xinyuan Chen and Yaohui Wang and Yu Qiao},
      year={2025},
      eprint={2512.10286},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2512.10286}, 
}
```