ApacheOne
/

HSWQ-fp8-SDXL

Model card Files Files and versions

HSWQ-fp8-SDXL / README.md

ApacheOne's picture

Update README.md

5c4e6e8 verified about 1 month ago

|

history blame contribute delete

2.81 kB

	---
	license: agpl-3.0
	---

	# Model info

	Creator: [https://civitai.com/user/jice](https://civitai.com/user/jice)

	[https://civitai.com/models/383364?modelVersionId=471056](https://civitai.com/models/383364?modelVersionId=471056)

	`creapromptLightning_creapromtHypersdxlV1_r32_r0.1_HSWQ_fp8e4m3.safetensors` full model

	`creapromptLightning_creapromtHypersdxlV1_r32_r0.1_HSWQ_fp8e4m3_unetonly.safetensors` unet only can use my [NVFP4 clips](https://huggingface.co/ApacheOne/sdxl_text_encoders-NVFP4) with taesd vae [https://github.com/madebyollin/taesd](https://github.com/madebyollin/taesd)

	[https://civitai.com/models/383364?modelVersionId=505350](https://civitai.com/models/383364?modelVersionId=505350)

	`creapromptLightning_creapromptHyperCFGV2_r32_r0.1_HSWQ_fp8e4m3.safetensors` full model

	`creapromptLightning_creapromptHyperCFGV2_r32_r0.1_HSWQ_fp8e4m3_unetonly.safetensors` unet only can use my [NVFP4 clips](https://huggingface.co/ApacheOne/sdxl_text_encoders-NVFP4) with taesd vae [https://github.com/madebyollin/taesd](https://github.com/madebyollin/taesd)

	# Hybrid-Sensitivity-Weighted-Quantization (HSWQ)

	<p align="center">
	<img src="https://raw.githubusercontent.com/ussoewwin/Hybrid-Sensitivity-Weighted-Quantization/main/icon.png" width="128">
	</p>

	High-fidelity FP8 quantization for diffusion models (SDXL). HSWQ uses sensitivity and importance analysis instead of naive uniform cast, and offers two modes: standard-compatible (V1) and high-performance scaled (V2).

	Technical details: [md/HSWQ_ Hybrid Sensitivity Weighted Quantization.md](https://github.com/ussoewwin/Hybrid-Sensitivity-Weighted-Quantization/blob/main/md/HSWQ_%20Hybrid%20Sensitivity%20Weighted%20Quantization.md)

	How to quantize: [md/HSWQ_ How to quantize SDXL.md](https://github.com/ussoewwin/Hybrid-Sensitivity-Weighted-Quantization/blob/main/md/How%20to%20quantize%20SDXL.md)

	SDXL Benchmark Test Results: [md/SDXL Benchmark Test Results.md](https://github.com/ussoewwin/Hybrid-Sensitivity-Weighted-Quantization/blob/main/test/benchmark_test.md)





	# Credit & Special Acknowledgement

	[https://github.com/ussoewwin/Hybrid-Sensitivity-Weighted-Quantization](https://github.com/ussoewwin/Hybrid-Sensitivity-Weighted-Quantization)

	[https://github.com/tritant/ComfyUI_Kitchen_nvfp4_Converter](https://github.com/tritant/ComfyUI_Kitchen_nvfp4_Converter)

	[https://github.com/NVIDIA/Model-Optimizer](https://github.com/NVIDIA/Model-Optimizer)

	We extend our deepest respect and gratitude to the Nunchaku Team for their groundbreaking work on SVDQ quantization and for sharing their models with the community. This collection relies heavily on their research and original implementation.
	- Original Repository: [nunchaku-tech/nunchaku-sdxl](https://huggingface.co/nunchaku-tech/nunchaku-sdxl)