comfyui

Please update the Boogu-Image-Turbo model

#10
by yeyingxian - opened

The Boogu Team release a hotfix Turbo model today. https://huggingface.co/Boogu/Boogu-Image-0.1-Turbo/tree/main
Please convert to ComfyUI's format

yeyingxian changed discussion status to closed
yeyingxian changed discussion status to open

nvfp4 and fp8 for boogu_image_turbo_hotfix, please! Thanks so much

fp8 hotfixes?

Comfy Org org

I won't be doing fp8 because int8-convrot is just much better, faster and better quality on all Nvidia GPUs. I've uploaded that now, but currently it needs very latest ComfyUI and comfy-kitchen versions.

I won't be doing fp8 because int8-convrot is just much better, faster and better quality on all Nvidia GPUs. I've uploaded that now, but currently it needs very latest ComfyUI and comfy-kitchen versions.

Is this a brand-new quantization format, and is FP8 about to be deprecated? Does this apply only to bboogu, or will it extend to future models as well?

Comfy Org org

I won't be doing fp8 because int8-convrot is just much better, faster and better quality on all Nvidia GPUs. I've uploaded that now, but currently it needs very latest ComfyUI and comfy-kitchen versions.

Is this a brand-new quantization format, and is FP8 about to be deprecated? Does this apply only to bboogu, or will it extend to future models as well?

Community has been using it for a while through Triton and custom nodes, it is now getting native support through cuda, and we'll monitor how well it performs with different models/GPUs before fully deciding that.

It does already look like fp8 is unnecessary for some models since int8-convrot is better quality and thus also allows quantizing more layers, ending up also faster on all Nvidia GPUs.

will int4 be natively support too? nunchaku int4 is as fast as nvfp4, and can work on 40s

Sign up or log in to comment