Spaces:
Running on Zero
Running on Zero
File size: 1,272 Bytes
cd58174 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 | # Create space for quantizing models
## Download and Upload
Needs code for downloading model file from a repo on hugginface using huggingface_hub
Needs code for uploading quantized model to a target repo as a pull request using huggingface_hub
## Quantization options in UI
Source repo and filename for input model
Target repo and filename for output model
### Quant format options in UI
int8 rowwise(add -int8mixedrow-simple to output model name):
int8=True
scaling_mode="row"
mxfp8(add -mxfp8mixed-simple to output model name):
mxfp8=True
fp8(default and add -fp8mixed-simple to output model name):
scaling_mode="tensor"
## Layer filter options in UI
Anima:
anima=True
Microsoft Lens:
lens=True
Flux2:
flux2=True
Chroma:
distillation_large=True
Radiance:
nerf_large=True
radiance=True
WAN:
wan=True
LTX-2.x:
ltxv2=True
Qwen Image(should add high precision matmul option):
qwen=True
full_precision_matrix_mult=True
Z-Image:
zimage=True
zimage_refiner=True
Regular expression(String value should be free text input):
exclude-layers="(substring_1|substring_2|substring_3)"
## Always included
comfy_quant=True
save_quant_metadata=True
low_memory=True
simple=True
calib_samples=40960 |