Wan2.2 β€” INT8 ConvRot Quantized

Wan2.2 INT8 ConvRot

INT8 tensor-wise quantized versions of the Wan2.2 14B diffusion models for ComfyUI, with ConvRot (group size 256) applied for maximum quantization accuracy on Ampere GPUs (RTX 30XX series).

Quantized with silveroxides/convert_to_quant directly from true BF16 source weights β€” single quantization pass, no double-quantization issues.


Download β€” INT8 ConvRot Models

Diffusion Models (this repo)

Text Encoder (this repo)

The following files are unchanged from the official Comfy-Org release β€” download them separately:

LoRA

VAE


File Storage Location

ComfyUI/
β”œβ”€β”€β”€πŸ“‚ models/
β”‚   β”œβ”€β”€β”€πŸ“‚ diffusion_models/
β”‚   β”‚   β”œβ”€β”€β”€ wan2.2_i2v_low_noise_14B_int8_convrot.safetensors   ← this repo
β”‚   β”‚   └─── wan2.2_i2v_high_noise_14B_int8_convrot.safetensors  ← this repo
β”‚   β”œβ”€β”€β”€πŸ“‚ loras/
β”‚   β”‚   β”œβ”€β”€β”€ wan2.2_i2v_lightx2v_4steps_lora_v1_low_noise.safetensors
β”‚   β”‚   └─── wan2.2_i2v_lightx2v_4steps_lora_v1_high_noise.safetensors
β”‚   β”œβ”€β”€β”€πŸ“‚ text_encoders/
β”‚   β”‚   └─── umt5_xxl_int8_convrot.safetensors  ← this repo
β”‚   β””β”€β”€β”€πŸ“‚ vae/
β”‚       └─── wan_2.1_vae.safetensors

Quantization Details

  • Format: INT8 tensor-wise (int8_tensorwise)
  • ConvRot group size: 256 β€” all Wan2.2 dimensions (256, 4096, 5120, 13824) divide cleanly into 256, so full-strength ConvRot is applied
  • Preset used: --wan (skips embeddings, encoders, head layers)
  • Source: true BF16 weights β€” single quantization pass, no FP8β†’INT8 double-quantization

Notes

  • Native INT8 support is available in recent ComfyUI builds (no extra custom nodes required for recent versions)
  • For older ComfyUI builds, install BobJohnson24/ComfyUI-INT8-Fast
  • These models are optimized for Ampere GPUs (RTX 30XX) β€” INT8 is significantly faster than FP8/FP4 on this generation

Disclaimer

Quantized versions of the original Wan2.2 models. All credit for the original models goes to their respective authors. Quantization may introduce minor differences in output quality compared to BF16/FP16 originals.

πŸ’¬ Discord: discord.gg/CJv5wceJaN β˜• Ko-fi: ko-fi.com/winnougan

Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support