htdong commited on
Commit
9459502
·
verified ·
1 Parent(s): aa898ee

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +109 -1
README.md CHANGED
@@ -3,4 +3,112 @@ license: apache-2.0
3
  base_model:
4
  - Wan-AI/Wan2.1-T2V-14B
5
  pipeline_tag: text-to-video
6
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3
  base_model:
4
  - Wan-AI/Wan2.1-T2V-14B
5
  pipeline_tag: text-to-video
6
+ ---
7
+ <div align="center">
8
+
9
+ <h1>
10
+ Wan-Alpha
11
+ </h1>
12
+
13
+ <h3>Wan-Alpha: High-Quality Text-to-Video Generation with Alpha Channel</h3>
14
+
15
+
16
+
17
+ [![arXiv](https://img.shields.io/badge/arXiv-xxxx-b31b1b)](https://arxiv.org/abs/)
18
+ [![Project Page](https://img.shields.io/badge/Project_Page-Link-green)](https://www.xxxx)
19
+ [![🤗 HuggingFace](https://img.shields.io/badge/%F0%9F%A4%97%20HuggingFace-Model-orange)](https://huggingface.co/xxxx)
20
+
21
+ </div>
22
+
23
+ <img src="assets/teaser.png" alt="Wan-Alpha Qualitative Results" style="max-width: 100%; height: auto;">
24
+
25
+ >Qualitative results of video generation using **Wan-Alpha**. Our model successfully generates various scenes with accurate and clearly rendered transparency. Notably, it can synthesize diverse semi-transparent objects, glowing effects, and fine-grained details such as hair.
26
+
27
+ ---
28
+
29
+ ## 🔥 News
30
+ * **[2025.09.30]** Released Wan-Alpha v1.0, the Wan2.1-14B-T2V–adapted weights and inference code are now open-sourced.
31
+
32
+ ---
33
+
34
+ ## 🌟 Showcase
35
+
36
+ ### Text-to-Video Generation with Alpha Channel
37
+ ## 🌟 Showcase
38
+
39
+ ### Text-to-Video Generation with Alpha Channel
40
+
41
+ <!-- | Prompt | Generated Video | Alpha Video |
42
+ | :---: | :---: | :---: |
43
+ | "Medium shot. A little girl holds a bubble wand and blows out colorful bubbles that float and pop in the air. The background of this video is transparent. Realistic style." |
44
+ <div style="display: flex; gap: 10px;">
45
+ <img src="girl.gif" alt="..." style="flex: 1; min-width: 200px;">
46
+ </div> |
47
+ <div style="display: flex; gap: 10px;">
48
+ <img src="girl_pha.gif" alt="..." style="flex: 1; min-width: 200px;">
49
+ </div> | -->
50
+ | Prompt | Generated Video | Alpha Video |
51
+ | :---: | :---: | :---: |
52
+ | "Medium shot. A little girl holds a bubble wand and blows out colorful bubbles that float and pop in the air. The background of this video is transparent. Realistic style." | <img src="assets/girl.gif" width="320" height="180" style="object-fit:contain; display:block; margin:auto;"/> | <img src="assets/girl_pha.gif" width="320" height="180" style="object-fit:contain; display:block; margin:auto;"/> |
53
+
54
+ ### For more results, please visit [https://Wan-Alpha.github.io/](https://www.xxx)
55
+
56
+ ## 🚀 Quick Start
57
+
58
+ ### 1. Environment Setup
59
+ ```bash
60
+ # Clone the project repository
61
+ git clone https://github.com/WeChatCV/Wan-Alpha.git
62
+ cd Wan-Alpha
63
+
64
+ # Create and activate Conda environment
65
+ conda create -n Wan-Alpha python=3.11 -y
66
+ conda activate Wan-Alpha
67
+
68
+ # Install dependencies
69
+ pip install -r requirements.txt
70
+ ```
71
+ ### 2. Model Download
72
+ Download [Wan2.1-T2V-14B](https://huggingface.co/Wan-AI/Wan2.1-T2V-14B)
73
+
74
+ Download [Lightx2v-T2V-14B](https://huggingface.co/Kijai/WanVideo_comfy/blob/main/Lightx2v/lightx2v_T2V_14B_cfg_step_distill_v2_lora_rank64_bf16.safetensors)
75
+
76
+ Download [Wan-Alpha VAE] ()
77
+
78
+ Download [Wan-Alpha T2V] ()
79
+ ## 🧪 Usage
80
+ ```bash
81
+ bash test_lightx2v_dora.sh
82
+ ```
83
+ **Prompt Writing Tip:** You need to specify that the background of the video is transparent, the visual style, the shot type (such as close-up, medium shot, wide shot, or extreme close-up), and a description of the main subject. Prompts support both Chinese and English input.
84
+
85
+ ```bash
86
+ # An example of prompt.
87
+ This video has a transparent background. Close-up shot. A colorful parrot flying. Realistic style.
88
+ ```
89
+ ## 🤝 Acknowledgements
90
+
91
+ This project is built upon the following excellent open-source projects:
92
+ * [DiffSynth-Studio](https://github.com/modelscope/DiffSynth-Studio) (training/inference framework)
93
+ * [Wan2.1](https://github.com/Wan-Video/Wan2.1) (base video generation model)
94
+ * [LightX2V](https://github.com/ModelTC/LightX2V) (inference acceleration)
95
+ * [WanVideo_comfy](https://huggingface.co/Kijai/WanVideo_comfy) (inference acceleration)
96
+
97
+ We sincerely thank the authors and contributors of these projects.
98
+
99
+ ---
100
+
101
+ ## ✏ Citation
102
+
103
+ If you find our work helpful for your research, please consider citing our paper:
104
+
105
+ ```bibtex
106
+ @article{
107
+ }
108
+ ```
109
+
110
+ ---
111
+
112
+ ## 📬 Contact Us
113
+
114
+ If you have any questions or suggestions, feel free to reach out via [GitHub Issues](https://github.com/WeChatCV/Wan-Alpha/issues) . We look forward to your feedback!