Update README.md
Browse files
README.md
CHANGED
|
@@ -33,4 +33,34 @@ Negative embeddings files: BadDream, UnrealisticDream
|
|
| 33 |
</p>
|
| 34 |
</td>
|
| 35 |
</tr>
|
| 36 |
-
</table>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 33 |
</p>
|
| 34 |
</td>
|
| 35 |
</tr>
|
| 36 |
+
</table>
|
| 37 |
+
|
| 38 |
+
|
| 39 |
+
# Hybrid-Sensitivity-Weighted-Quantization (HSWQ)
|
| 40 |
+
|
| 41 |
+
<p align="center">
|
| 42 |
+
<img src="https://raw.githubusercontent.com/ussoewwin/Hybrid-Sensitivity-Weighted-Quantization/main/icon.png" width="128">
|
| 43 |
+
</p>
|
| 44 |
+
|
| 45 |
+
High-fidelity FP8 quantization for diffusion models (SDXL). HSWQ uses **sensitivity** and **importance** analysis instead of naive uniform cast, and offers two modes: standard-compatible (V1) and high-performance scaled (V2).
|
| 46 |
+
|
| 47 |
+
**Technical details:** [md/HSWQ_ Hybrid Sensitivity Weighted Quantization.md](https://github.com/ussoewwin/Hybrid-Sensitivity-Weighted-Quantization/blob/main/md/HSWQ_%20Hybrid%20Sensitivity%20Weighted%20Quantization.md)
|
| 48 |
+
|
| 49 |
+
**How to quantize:** [md/HSWQ_ How to quantize SDXL.md](https://github.com/ussoewwin/Hybrid-Sensitivity-Weighted-Quantization/blob/main/md/How%20to%20quantize%20SDXL.md)
|
| 50 |
+
|
| 51 |
+
**SDXL Benchmark Test Results:** [md/SDXL Benchmark Test Results.md](https://github.com/ussoewwin/Hybrid-Sensitivity-Weighted-Quantization/blob/main/test/benchmark_test.md)
|
| 52 |
+
|
| 53 |
+
|
| 54 |
+
|
| 55 |
+
|
| 56 |
+
|
| 57 |
+
# Credit & Special Acknowledgement
|
| 58 |
+
|
| 59 |
+
[https://github.com/ussoewwin/Hybrid-Sensitivity-Weighted-Quantization](https://github.com/ussoewwin/Hybrid-Sensitivity-Weighted-Quantization)
|
| 60 |
+
|
| 61 |
+
[https://github.com/tritant/ComfyUI_Kitchen_nvfp4_Converter](https://github.com/tritant/ComfyUI_Kitchen_nvfp4_Converter)
|
| 62 |
+
|
| 63 |
+
[https://github.com/NVIDIA/Model-Optimizer](https://github.com/NVIDIA/Model-Optimizer)
|
| 64 |
+
|
| 65 |
+
We extend our deepest respect and gratitude to the **Nunchaku Team** for their groundbreaking work on SVDQ quantization and for sharing their models with the community. This collection relies heavily on their research and original implementation.
|
| 66 |
+
- **Original Repository:** [nunchaku-tech/nunchaku-sdxl](https://huggingface.co/nunchaku-tech/nunchaku-sdxl)
|