File size: 1,272 Bytes
cd58174
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
# Create space for quantizing models

## Download and Upload

Needs code for downloading model file from a repo on hugginface using huggingface_hub



Needs code for uploading quantized model to a target repo as a pull request using huggingface_hub

## Quantization options in UI

Source repo and filename for input model

Target repo and filename for output model


### Quant format options in UI

int8 rowwise(add -int8mixedrow-simple to output model name):
int8=True
scaling_mode="row"



mxfp8(add -mxfp8mixed-simple to output model name):

mxfp8=True



fp8(default and add -fp8mixed-simple to output model name):

scaling_mode="tensor"

## Layer filter options in UI

Anima:
anima=True

Microsoft Lens:
lens=True

Flux2:
flux2=True

Chroma:
distillation_large=True



Radiance:

nerf_large=True
radiance=True

WAN:
wan=True

LTX-2.x:
ltxv2=True

Qwen Image(should add high precision matmul option):
qwen=True
full_precision_matrix_mult=True



Z-Image:

zimage=True

zimage_refiner=True

Regular expression(String value should be free text input):
exclude-layers="(substring_1|substring_2|substring_3)"



## Always included



comfy_quant=True
save_quant_metadata=True
low_memory=True

simple=True

calib_samples=40960