File size: 2,626 Bytes
a56ee5e d6c1285 adc9307 a2dd0de 1ab42f6 a2dd0de ac7a7c4 e5b58cc ac7a7c4 8594a59 ac7a7c4 a37345a f15e42c | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 | ---
license: other
license_name: circlestone-labs-non-commercial-license
license_link: https://huggingface.co/circlestone-labs/Anima/blob/main/LICENSE.md
base_model:
- circlestone-labs/Anima
pipeline_tag: text-to-image
base_model_relation: quantized
---
# Anima-Base-FP8

This repository provides the **FP8 quantized version** of the [Anima-Base](https://huggingface.co/circlestone-labs/Anima) model.
It is optimized to significantly reduce VRAM usage while maintaining high-quality generation, making it much easier to run on consumer-grade GPUs with limited VRAM.
## Quantization Tool
This model was quantized using the following open-source tool:
* **Quantizer**: [comfy-dit-quantizer](https://github.com/bedovyy/comfy-dit-quantizer)
## Quantized Models
There are two models - FP8 and FP8-balanced
- FP8 (2.4GB) : (***recommend***) maximize generation speed while preserving quality as much as possible.
- FP8-balanced (2.7GB) : (***Personal Preference***) retain the prefix and suffix blocks intact, while exclusively modifying the Self-Attention and MLP layers. As a result, its performance is remarkably close to the original BF16 model.
| quant | sample |
|------------|----------------------|
| **bf16** ||
| **fp8** ||
| **fp8-balanced** ||
## Quantized layers
### fp8
```json
{
"format": "comfy_quant",
"block_names": ["net.blocks."],
"rules": [
{ "policy": "keep", "match": ["blocks.0", "blocks.1."] },
{ "policy": "float8_e4m3fn", "match": ["q_proj", "k_proj", "v_proj", "o_proj", "output_proj", ".mlp"] },
{ "policy": "nvfp4", "match": [] }
]
}
```
### fp8-balanced
```json
{
"format": "comfy_quant",
"block_names": ["net.blocks."],
"rules": [
{ "policy": "keep", "match": ["blocks.0.", "blocks.1.", "blocks.26.", "blocks.27."] },
{ "policy": "float8_e4m3fn", "match": ["self_attn.", ".mlp"] },
{ "policy": "nvfp4", "match": [] }
]
}
```
## Acknowledgments
* **Inspired by**: [Bedovyy](https://huggingface.co/Bedovyy) (This is his repo: [Anima-FP8](https://huggingface.co/Bedovyy/Anima-FP8)) |