update: 549 fp8 (Linear-only), Conv weights stay bf16
Browse files
README.md
CHANGED
|
@@ -91,8 +91,9 @@ native names, run a key remap (see `diffusers_to_native_wan` in
|
|
| 91 |
|
| 92 |
## What's in the file
|
| 93 |
|
| 94 |
-
- **
|
| 95 |
-
FFN projections, controlnet linears
|
|
|
|
| 96 |
- **1800 BF16 tensors** (5.84 GB):
|
| 97 |
- 362 `norm` tensors (RMSNorm gains): 1.26 GB
|
| 98 |
- 80 `add_*` tensors (I2V cross-attention to image embeddings): 4.19 GB
|
|
|
|
| 91 |
|
| 92 |
## What's in the file
|
| 93 |
|
| 94 |
+
- **549 FP8 tensors** (14.5 GB): rank-2 Linear weights only (self-attn
|
| 95 |
+
Q/K/V/O, FFN projections, controlnet linears). Conv weights stay BF16:
|
| 96 |
+
no scaled-fp8 Conv kernel exists in PyTorch / ComfyUI
|
| 97 |
- **1800 BF16 tensors** (5.84 GB):
|
| 98 |
- 362 `norm` tensors (RMSNorm gains): 1.26 GB
|
| 99 |
- 80 `add_*` tensors (I2V cross-attention to image embeddings): 4.19 GB
|