Spaces:
Running on Zero
Create space for quantizing models
Download and Upload
Needs code for downloading model file from a repo on hugginface using huggingface_hub
Needs code for uploading quantized model to a target repo as a pull request using huggingface_hub
Quantization options in UI
Source repo and filename for input model
Target repo and filename for output model
Quant format options in UI
int8 rowwise(add -int8mixedrow-simple to output model name): int8=True scaling_mode="row"
mxfp8(add -mxfp8mixed-simple to output model name): mxfp8=True
fp8(default and add -fp8mixed-simple to output model name): scaling_mode="tensor"
Layer filter options in UI
Anima: anima=True
Microsoft Lens: lens=True
Flux2: flux2=True
Chroma: distillation_large=True
Radiance: nerf_large=True radiance=True
WAN: wan=True
LTX-2.x: ltxv2=True
Qwen Image(should add high precision matmul option): qwen=True full_precision_matrix_mult=True
Z-Image: zimage=True zimage_refiner=True
Regular expression(String value should be free text input): exclude-layers="(substring_1|substring_2|substring_3)"
Always included
comfy_quant=True save_quant_metadata=True low_memory=True simple=True calib_samples=40960