Spaces:

silveroxides
/

Quick-Quantize

Running on Zero

Upload folder using huggingface_hub

cd58174 verified 1 day ago

1.27 kB

Create space for quantizing models

Needs code for downloading model file from a repo on hugginface using huggingface_hub

Needs code for uploading quantized model to a target repo as a pull request using huggingface_hub

Source repo and filename for input model

Target repo and filename for output model

int8 rowwise(add -int8mixedrow-simple to output model name): int8=True scaling_mode="row"

mxfp8(add -mxfp8mixed-simple to output model name): mxfp8=True

fp8(default and add -fp8mixed-simple to output model name): scaling_mode="tensor"

Anima: anima=True

Microsoft Lens: lens=True

Flux2: flux2=True

Chroma: distillation_large=True

Radiance: nerf_large=True radiance=True

WAN: wan=True

LTX-2.x: ltxv2=True

Qwen Image(should add high precision matmul option): qwen=True full_precision_matrix_mult=True

Z-Image: zimage=True zimage_refiner=True

Regular expression(String value should be free text input): exclude-layers="(substring_1|substring_2|substring_3)"

comfy_quant=True save_quant_metadata=True low_memory=True simple=True calib_samples=40960