obsxrver's picture
Update README.md
d225df1 verified
|
Raw
History Blame Contribute Delete
1.02 kB
metadata
tags:
  - int8-convrot
  - comfyui

ComfyUI-Native-Int8-ConvRot

Int8 ConvRot models, converted to the format ComfyUI Expects.

Int8 ConvRot is the best quantization method so far in terms of Quality:Performance ratio. In my personal experience Int8 ConvRot models provide a similar level of quality to BF16 at a generation time matching or beating FP8_Scaled

'INT8 ConvRot is row-wise INT8 with parameters and activations rotated before quantization via ConvRot.' https://github.com/BobJohnson24/ComfyUI-INT8-Fast/blob/main/Metrics.md

Quality Ranking:

GGUF Q8 > INT8 ConvRot > MXFP8 > FP8 >= INT8 Row > INT8 Tensorwise

References:

  1. https://www.reddit.com/r/StableDiffusion/comments/1uimp1j/so_is_int8convrot_the_new_hot_thing/
  2. https://github.com/BobJohnson24/ComfyUI-INT8-Fast/blob/main/Metrics.md
  3. https://huggingface.co/Comfy-Org/Boogu-Image/discussions/10#6a404ed359b6d5b4e834a644
  4. https://github.com/Comfy-Org/ComfyUI/pull/14636
  5. https://huggingface.co/bertbobson/ComfyUI-INT8_ConvRot