Update README.md
Browse files
README.md
CHANGED
|
@@ -67,7 +67,7 @@ For VAE, the LightX2V team has conducted a series of deep optimizations, derivin
|
|
| 67 |
**Features**: Best Balanced Solution βοΈ
|
| 68 |
|
| 69 |
β
Uses **Causal 3D Conv** (same as official)
|
| 70 |
-
β
**Quality close to official**
|
| 71 |
β
Memory reduced by **~50%** (~4-5 GB)
|
| 72 |
β
Speed increased by **2-3x**
|
| 73 |
β
Balances quality, speed, and memory π
|
|
@@ -80,7 +80,7 @@ For VAE, the LightX2V team has conducted a series of deep optimizations, derivin
|
|
| 80 |
|
| 81 |
β
Minimal memory usage (~0.4 GB)
|
| 82 |
β
Extremely fast inference
|
| 83 |
-
β
**Quality close to official**
|
| 84 |
β
**Significantly surpasses open source TAE**
|
| 85 |
|
| 86 |
</td>
|
|
@@ -136,24 +136,23 @@ Model: seko-talk
|
|
| 136 |
<tr>
|
| 137 |
<td width="25%" align="center">
|
| 138 |
<strong>Wan2.1_VAE</strong><br>
|
| 139 |
-
<video controls autoplay width="100%" src="https://cdn-uploads.huggingface.co/production/uploads/680de13385293771bc57400b/6l-P-3Hr9JKL3xgUyJXWJ.mp4"></video>
|
| 140 |
</td>
|
| 141 |
<td width="25%" align="center">
|
| 142 |
<strong>taew2_1</strong><br>
|
| 143 |
-
<video controls autoplay width="100%" src="https://cdn-uploads.huggingface.co/production/uploads/680de13385293771bc57400b/rcVHrCKB4nRAs2VSjJd2d.mp4"></video>
|
| 144 |
</td>
|
| 145 |
<td width="25%" align="center">
|
| 146 |
<strong>lighttaew2_1</strong><br>
|
| 147 |
-
<video controls autoplay width="100%" src="https://cdn-uploads.huggingface.co/production/uploads/680de13385293771bc57400b/Wq9p9Z7NDYwaKw4SqVbYT.mp4"></video>
|
| 148 |
</td>
|
| 149 |
<td width="25%" align="center">
|
| 150 |
<strong>lightvaew2_1</strong><br>
|
| 151 |
-
<video controls autoplay width="100%" src="https://cdn-uploads.huggingface.co/production/uploads/680de13385293771bc57400b/NpKOzFcvsHzSFfFACzUKP.mp4"></video>
|
| 152 |
</td>
|
| 153 |
</tr>
|
| 154 |
</table>
|
| 155 |
|
| 156 |
-
|
| 157 |
## π Wan2.2 Series Performance Comparison
|
| 158 |
- **Precision**: BF16
|
| 159 |
- **Test Hardware**: NVIDIA H100
|
|
@@ -221,7 +220,7 @@ Model: [Wan2.2-TI2V-5B](https://huggingface.co/Wan-AI/Wan2.2-TI2V-5B)
|
|
| 221 |
- β
**Excellent quality**, close to official
|
| 222 |
- β
Memory reduced by **~50%** (~4-5 GB)
|
| 223 |
- β
Speed increased by **2-3x**
|
| 224 |
-
- β
**Close to official quality**
|
| 225 |
|
| 226 |
**Use Cases**: Daily production, strongly recommended β
|
| 227 |
|
|
@@ -234,7 +233,7 @@ Model: [Wan2.2-TI2V-5B](https://huggingface.co/Wan-AI/Wan2.2-TI2V-5B)
|
|
| 234 |
- β
Extremely low memory usage (~0.4 GB)
|
| 235 |
- β
Extremely fast inference
|
| 236 |
- β
**Quality significantly surpasses open source TAE**
|
| 237 |
-
- β
**Close to official quality**
|
| 238 |
|
| 239 |
**Use Cases**: Development testing, rapid iteration
|
| 240 |
|
|
@@ -250,11 +249,11 @@ Model: [Wan2.2-TI2V-5B](https://huggingface.co/Wan-AI/Wan2.2-TI2V-5B)
|
|
| 250 |
| **Architecture** | Conv2D | Conv2D | Causal Conv3D | Causal Conv3D |
|
| 251 |
| **Memory Usage** | Minimal (~0.4 GB) | Minimal (~0.4 GB) | Large (~8-12 GB) | Medium (~4-5 GB) |
|
| 252 |
| **Inference Speed** | Extremely Fast β‘β‘β‘β‘β‘ | Extremely Fast β‘β‘β‘β‘β‘ | Slow β‘β‘ | Fast β‘β‘β‘β‘ |
|
| 253 |
-
| **Generation Quality** | Average βββ | **Close to Official**
|
| 254 |
|
| 255 |
## π Todo List
|
| 256 |
- [x] LightX2V integration
|
| 257 |
-
- [
|
| 258 |
- [ ] Training & Distillation Code
|
| 259 |
|
| 260 |
## π Usage
|
|
@@ -397,6 +396,10 @@ cd LightX2V/scripts
|
|
| 397 |
bash wan/run_wan_i2v.sh # or other inference scripts
|
| 398 |
```
|
| 399 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 400 |
## β οΈ Important Notes
|
| 401 |
|
| 402 |
### 1. Compatibility
|
|
|
|
| 67 |
**Features**: Best Balanced Solution βοΈ
|
| 68 |
|
| 69 |
β
Uses **Causal 3D Conv** (same as official)
|
| 70 |
+
β
**Quality close to official** ββββ
|
| 71 |
β
Memory reduced by **~50%** (~4-5 GB)
|
| 72 |
β
Speed increased by **2-3x**
|
| 73 |
β
Balances quality, speed, and memory π
|
|
|
|
| 80 |
|
| 81 |
β
Minimal memory usage (~0.4 GB)
|
| 82 |
β
Extremely fast inference
|
| 83 |
+
β
**Quality close to official** ββββ
|
| 84 |
β
**Significantly surpasses open source TAE**
|
| 85 |
|
| 86 |
</td>
|
|
|
|
| 136 |
<tr>
|
| 137 |
<td width="25%" align="center">
|
| 138 |
<strong>Wan2.1_VAE</strong><br>
|
| 139 |
+
<video controls autoplay muted width="100%" src="https://cdn-uploads.huggingface.co/production/uploads/680de13385293771bc57400b/6l-P-3Hr9JKL3xgUyJXWJ.mp4"></video>
|
| 140 |
</td>
|
| 141 |
<td width="25%" align="center">
|
| 142 |
<strong>taew2_1</strong><br>
|
| 143 |
+
<video controls autoplay muted width="100%" src="https://cdn-uploads.huggingface.co/production/uploads/680de13385293771bc57400b/rcVHrCKB4nRAs2VSjJd2d.mp4"></video>
|
| 144 |
</td>
|
| 145 |
<td width="25%" align="center">
|
| 146 |
<strong>lighttaew2_1</strong><br>
|
| 147 |
+
<video controls autoplay muted width="100%" src="https://cdn-uploads.huggingface.co/production/uploads/680de13385293771bc57400b/Wq9p9Z7NDYwaKw4SqVbYT.mp4"></video>
|
| 148 |
</td>
|
| 149 |
<td width="25%" align="center">
|
| 150 |
<strong>lightvaew2_1</strong><br>
|
| 151 |
+
<video controls autoplay muted width="100%" src="https://cdn-uploads.huggingface.co/production/uploads/680de13385293771bc57400b/NpKOzFcvsHzSFfFACzUKP.mp4"></video>
|
| 152 |
</td>
|
| 153 |
</tr>
|
| 154 |
</table>
|
| 155 |
|
|
|
|
| 156 |
## π Wan2.2 Series Performance Comparison
|
| 157 |
- **Precision**: BF16
|
| 158 |
- **Test Hardware**: NVIDIA H100
|
|
|
|
| 220 |
- β
**Excellent quality**, close to official
|
| 221 |
- β
Memory reduced by **~50%** (~4-5 GB)
|
| 222 |
- β
Speed increased by **2-3x**
|
| 223 |
+
- β
**Close to official quality** ββββ
|
| 224 |
|
| 225 |
**Use Cases**: Daily production, strongly recommended β
|
| 226 |
|
|
|
|
| 233 |
- β
Extremely low memory usage (~0.4 GB)
|
| 234 |
- β
Extremely fast inference
|
| 235 |
- β
**Quality significantly surpasses open source TAE**
|
| 236 |
+
- β
**Close to official quality** ββββ
|
| 237 |
|
| 238 |
**Use Cases**: Development testing, rapid iteration
|
| 239 |
|
|
|
|
| 249 |
| **Architecture** | Conv2D | Conv2D | Causal Conv3D | Causal Conv3D |
|
| 250 |
| **Memory Usage** | Minimal (~0.4 GB) | Minimal (~0.4 GB) | Large (~8-12 GB) | Medium (~4-5 GB) |
|
| 251 |
| **Inference Speed** | Extremely Fast β‘β‘β‘β‘β‘ | Extremely Fast β‘β‘β‘β‘β‘ | Slow β‘β‘ | Fast β‘β‘β‘β‘ |
|
| 252 |
+
| **Generation Quality** | Average βββ | **Close to Official** ββββ | Highest βββββ | **Close to Official** ββββ |
|
| 253 |
|
| 254 |
## π Todo List
|
| 255 |
- [x] LightX2V integration
|
| 256 |
+
- [x] ComfyUI integration
|
| 257 |
- [ ] Training & Distillation Code
|
| 258 |
|
| 259 |
## π Usage
|
|
|
|
| 396 |
bash wan/run_wan_i2v.sh # or other inference scripts
|
| 397 |
```
|
| 398 |
|
| 399 |
+
### Use in ComfyUI
|
| 400 |
+
|
| 401 |
+
please refer to https://github.com/ModelTC/ComfyUI-LightVAE
|
| 402 |
+
|
| 403 |
## β οΈ Important Notes
|
| 404 |
|
| 405 |
### 1. Compatibility
|