| --- |
| license: other |
| license_name: ideogram-4-non-commercial |
| license_link: https://huggingface.co/ideogram-ai/ideogram-4-fp8/blob/main/LICENSE.md |
| pipeline_tag: text-to-image |
| tags: |
| - text-to-image |
| - image-generation |
| - diffusion |
| - flow-matching |
| - dit |
| - ideogram |
| --- |
| |
| This is an unavoidable double quantization due to the release state of Ideogram4. |
|
|
| The FP8 weights were cast to FP32 with the FP8 scales, then downcast to BF16 before being converted to INT8. |
|
|
| For use in ComfyUI with https://github.com/BobJohnson24/ComfyUI-INT8-Fast |
|
|
| Speed is 1.78x faster(2.03s/it) than FP8(3.62s/it) on my 3090, without compile. |
|
|
| <s>~2x faster with torch compile.</s> |
|
|
| After further inspection, it appears there may be quality issues with torch compiling this model. |
|
|
| Quick comparison: |
|
|
| <img src="Comparison.jpg" width="1000" height="500"> |
|
|
| <img src="Comparison2.jpg" width="1000" height="500"> |