Spaces:

silveroxides
/

Quick-Quantize

Running on Zero

File size: 1,272 Bytes

cd58174

# Create space for quantizing models

## Download and Upload

Needs code for downloading model file from a repo on hugginface using huggingface_hub



Needs code for uploading quantized model to a target repo as a pull request using huggingface_hub

## Quantization options in UI

Source repo and filename for input model

Target repo and filename for output model


### Quant format options in UI

int8 rowwise(add -int8mixedrow-simple to output model name):
int8=True
scaling_mode="row"



mxfp8(add -mxfp8mixed-simple to output model name):

mxfp8=True



fp8(default and add -fp8mixed-simple to output model name):

scaling_mode="tensor"

## Layer filter options in UI

Anima:
anima=True

Microsoft Lens:
lens=True

Flux2:
flux2=True

Chroma:
distillation_large=True



Radiance:

nerf_large=True
radiance=True

WAN:
wan=True

LTX-2.x:
ltxv2=True

Qwen Image(should add high precision matmul option):
qwen=True
full_precision_matrix_mult=True



Z-Image:

zimage=True

zimage_refiner=True

Regular expression(String value should be free text input):
exclude-layers="(substring_1|substring_2|substring_3)"



## Always included



comfy_quant=True
save_quant_metadata=True
low_memory=True

simple=True

calib_samples=40960