InsecureErasure
/

Z-Image-Turbo-NVFP4

Model card Files Files and versions

InsecureErasure commited on 4 days ago

Commit

513544a

·

verified ·

1 Parent(s): a3b00eb

Update README.md

Files changed (1) hide show

README.md +15 -15

README.md CHANGED Viewed

@@ -103,10 +103,10 @@ convert_to_quant -i $1 \
 | File | Description |
 |---|---|
-| `z_image_turbo_nvfp4_mixed.safetensors` | Quantized weights |
-| `z_image_turbo_nvfp4_mixed_lora.safetensors` | Error-correction LoRA (rank 32) |
-Use the LoRA at **1.5–2.0** strength in ComfyUI for maximum fidelity.
 ## Requirements
@@ -115,18 +115,18 @@ Use the LoRA at **1.5–2.0** strength in ComfyUI for maximum fidelity.
 ## Comparison
-| | NVFP4 Mixed (this) | [MXFP8 Uniform](https://huggingface.co/InsecureErasure/Z-Image-Turbo-MXFP8) | [Official NVFP4](https://huggingface.co/Comfy-Org/z_image_turbo) |
-|---|---|---|---|---:|
-| **Size** | 4.84 GB | 6.23 GB | 4.51 GB |
-| **Base format** | NVFP4 (4-bit) | MXFP8 (8-bit) | NVFP4 (4-bit) |
-| **Custom layers** | ~100 tensors → MXFP8 | None | None |
-| **BF16 exclusions** | ~20 surgical | 8 patterns | Refiners fully BF16 |
-| **Learned rounding** | ✅ 6000 iter | ❌ `--simple` | ❌ |
-| **LoRA** | ✅ rank 32 | ❌ | ❌ |
-| **Refiner block 0** | MXFP8 | MXFP8 | BF16 |
-| **Late adaLN (22–29)** | BF16 | BF16 | NVFP4 ⚠️ |
-| **Last QKV (layer 29)** | BF16 | BF16 | NVFP4 ⚠️ |
-| **Quantization time¹** | ~60–90 min | ~5–10 min | N/A |
 ¹ Estimated on RTX 5060 (Blackwell) with `comfy-kitchen` CUDA kernels.

 | File | Description |
 |---|---|
+| `z_image_turbo_nvfp4.safetensors` | Quantized weights |
+| `z_image_turbo_nvfp4_lora.safetensors` | Error-correction LoRA (rank 32) |
+Use the LoRA with variable strength in ComfyUI for improved fidelity.
 ## Requirements
 ## Comparison
+| | NVFP4 Mixed (this) | MXFP8 Uniform | Official NVFP4 |
+| --- | --- | --- | --- |
+| Size | 4.84 GB | 6.23 GB | 4.51 GB |
+| Base format | NVFP4 (4-bit) | MXFP8 (8-bit) | NVFP4 (4-bit) |
+| Custom layers | ~100 tensors → MXFP8 | None | None |
+| BF16 exclusions | ~20 tensors | 8 patterns | Refiners fully BF16 |
+| Learned rounding | ✅ 6000 iter | ❌ --simple | ❌ |
+| LoRA | ✅ rank 32 | ❌ | ❌ |
+| Refiner block 0 | MXFP8 | MXFP8 | BF16 |
+| Late adaLN (22–29) | BF16 | BF16 | NVFP4 ⚠️ |
+| Last QKV (layer 29) | BF16 | BF16 | NVFP4 ⚠️ |
+| Quantization time¹ | ~60–90 min | ~5–10 min | N/A |
 ¹ Estimated on RTX 5060 (Blackwell) with `comfy-kitchen` CUDA kernels.