Text-to-Image
Diffusion Single File
comfyui

FP8/MXFP8 Quantized model of ANIMA

To use torch.compile on FP8/MXFP8 models, set to max-autotune-no-cudagraphs mode and dynamic to false.

image

Generation speed

Tested on

  • RTX5090 (400W), ComfyUI(commit id c96fcdd) with --fast option, torch2.12.0+cu130
  • Generates 832x1216, 30steps, cfg 5.0, er_sde, simple
quant sage+torch.compile
bf16 5.03s, 6.15it/s
fp8 4.52s, 6.88it/s
mxfp8 4.71s, 6.58it/s

Sample

anima-base-v1.0

20260603_014607-Anima_00001_

anima-preview3-base

26-04-09-Anima_00005_

anima-preview2

26-03-12-Anima_00008_

anima-preview

quant sample
bf16 anima-preview-bf16
fp8 anima-preview-fp8
nvfp4mixed anima-preview-nvfp4

Quantized layers

quantized by comfy-dit-quantizer

fp8

{
  "format": "comfy_quant",
  "block_names": ["net.blocks."],
  "rules": [
    { "policy": "keep", "match": ["blocks.0.", "blocks.1.", "blocks.27.", "adaln_modulation"] },
    { "policy": "float8_e4m3fn", "match": ["q_proj", "k_proj", "v_proj", "output_proj", ".mlp"] },
    { "policy": "nvfp4", "match": [] }
  ]
}

nvfp4mixed

{
  "format": "comfy_quant",
  "block_names": ["net.blocks."],
  "rules": [
    { "policy": "keep", "match": ["blocks.0.", "blocks.1.", "blocks.27.", "adaln_modulation"] },
    { "policy": "mxfp8", "match": ["q_proj", "k_proj", "v_proj", "output_proj", ".mlp"] },
    { "policy": "nvfp4", "match": [] }
  ]
}
Downloads last month
1,171
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Bedovyy/Anima-FP8

Quantized
(24)
this model
Adapters
1 model