FLUX.2-dev INT8 W8A8 ConvRot Quant

INT8 W8A8 ConvRot quantization of black-forest-labs/FLUX.2-dev, packaged for use with ComfyUI-INT8-Fast.

It is a quantized version of the original FLUX.2-dev weights intended to reduce VRAM use and improve inference speed in supported ComfyUI workflows. Roughly 2x faster inference on INT8 supported cards.

Model Details

  • Base model: black-forest-labs/FLUX.2-dev
  • Quantization: INT8 W8A8
  • Rotation method: ConvRot
  • Target runtime: ComfyUI with ComfyUI-INT8-Fast
  • Model type: Rectified flow transformer image generation / editing model
  • License: FLUX Non-Commercial License, inherited from FLUX.2-dev

Intended Use

Use this checkpoint in ComfyUI through the ComfyUI-INT8-Fast custom node. Also tested working with Flux2TurboComfyv2 low step lora from here with the "pre_lora" loader.

How to Use

  1. Install ComfyUI.
  2. Install triton and ComfyKitchen (this model was tested and working with cuda128)
  3. Install the custom node:
  4. Download this checkpoint from the Hugging Face repository.
  5. Download text-encoder and vae from here.
  6. Place the model files in the text_encoders, vae and diffusion_models subfolders expected by your ComfyUI setup.
  7. Load it with the INT8 model loader node from ComfyUI-INT8-Fast.

Refer to the custom node repository for current installation requirements and workflow examples.

License and Use Restrictions

This quantization is derived from black-forest-labs/FLUX.2-dev and follows the same FLUX Non-Commercial License terms. Also see 'here' for the license that BobJohnson24/ComfyUI-INT8-Fast falls under.

Users are responsible for complying with the original FLUX.2-dev license, acceptable use policy, and any additional restrictions from Black Forest Labs, ComfyUI and ComfyUI-INT8-Fast.

Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for solphor/Flux2-Dev-INT8-W8A8-Convrot-Model

Quantized
(12)
this model