Instructions to use Bedovyy/Anima-FP8 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusion Single File
How to use Bedovyy/Anima-FP8 with Diffusion Single File:
# No code snippets available yet for this library. # To use this model, check the repository files and the library's documentation. # Want to help? PRs adding snippets are welcome at: # https://github.com/huggingface/huggingface.js
- Notebooks
- Google Colab
- Kaggle
FP8/MXFP8 Quantized model of ANIMA
To use torch.compile on FP8/MXFP8 models, set to max-autotune-no-cudagraphs mode and dynamic to false.
Generation speed
Tested on
- RTX5090 (400W), ComfyUI(commit id
c96fcdd) with--fastoption, torch2.12.0+cu130 - Generates 832x1216, 30steps, cfg 5.0, er_sde, simple
| quant | sage+torch.compile |
|---|---|
| bf16 | 5.03s, 6.15it/s |
| fp8 | 4.52s, 6.88it/s |
| mxfp8 | 4.71s, 6.58it/s |
Sample
anima-base-v1.0
anima-preview3-base
anima-preview2
anima-preview
Quantized layers
quantized by comfy-dit-quantizer
fp8
{
"format": "comfy_quant",
"block_names": ["net.blocks."],
"rules": [
{ "policy": "keep", "match": ["blocks.0.", "blocks.1.", "blocks.27.", "adaln_modulation"] },
{ "policy": "float8_e4m3fn", "match": ["q_proj", "k_proj", "v_proj", "output_proj", ".mlp"] },
{ "policy": "nvfp4", "match": [] }
]
}
nvfp4mixed
{
"format": "comfy_quant",
"block_names": ["net.blocks."],
"rules": [
{ "policy": "keep", "match": ["blocks.0.", "blocks.1.", "blocks.27.", "adaln_modulation"] },
{ "policy": "mxfp8", "match": ["q_proj", "k_proj", "v_proj", "output_proj", ".mlp"] },
{ "policy": "nvfp4", "match": [] }
]
}
- Downloads last month
- 1,171






