Bedovyy
/

Anima-FP8

Bedovyy commited on Feb 9

Commit

f6435fa

verified ·

1 Parent(s): 9625f2a

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -13,14 +13,16 @@ base_model_relation: quantized
 There are two models - FP8 and NVFP4Mixed.
-- FP8 : (***recommend***) maximize generation speed while preserving quality as much as possible.
-- NVFP4Mixed : (***marginal quality***) Mixture of FP8 and NVFP4.
 ## Generation speed
 Tested on
-- RTX5090 (400W), ComfyUI with `--fast`option, torch2.10.0+cu130
 - Generates 832x1216, 30steps, cfg 4.0, er sde, simple
 | quant      | none                 | sage+torch.compile    |
@@ -30,14 +32,13 @@ Tested on
 | nvfp4mix  | 6.37s/4.71it/s (+12%)| 4.99s/6.01it/s (+43%) |
 ## Sample
 | quant      | sample               |
 |------------|----------------------|
-| bf16       | ![anima-preview-bf16](https://cdn-uploads.huggingface.co/production/uploads/63fbf6951b4b1bd4e706fed1/MUYIxQjZZxX5wGGwPHS3G.webp)  |
-| fp8        | ![anima-preview-fp8](https://cdn-uploads.huggingface.co/production/uploads/63fbf6951b4b1bd4e706fed1/b09IufBT31yDxg_BRZR3w.webp)   |
-| nvfp4mixed | ![anima-preview-nvfp4](https://cdn-uploads.huggingface.co/production/uploads/63fbf6951b4b1bd4e706fed1/FMcirkGeDMAdbzkniOARF.webp) |
 ## Quantized layers

 There are two models - FP8 and NVFP4Mixed.
+- FP8 (2.4GB) : (***recommend***) maximize generation speed while preserving quality as much as possible.
+- NVFP4Mixed (2.0GB): (***marginal quality***) Mixture of FP8 and NVFP4.
+To use `torch.compile`, use the `TorchCompileModelAdvanced` node from KJNodes, set the mode to `max-autotune-no-cudagraphs`, and make sure `dynamic` is set to `false`.
 ## Generation speed
 Tested on
+- RTX5090 (400W), ComfyUI with `--fast` option, torch2.10.0+cu130
 - Generates 832x1216, 30steps, cfg 4.0, er sde, simple
 | quant      | none                 | sage+torch.compile    |
 | nvfp4mix  | 6.37s/4.71it/s (+12%)| 4.99s/6.01it/s (+43%) |
 ## Sample
 | quant      | sample               |
 |------------|----------------------|
+| **bf16**       | ![anima-preview-bf16](https://cdn-uploads.huggingface.co/production/uploads/63fbf6951b4b1bd4e706fed1/MUYIxQjZZxX5wGGwPHS3G.webp)  |
+| **fp8**        | ![anima-preview-fp8](https://cdn-uploads.huggingface.co/production/uploads/63fbf6951b4b1bd4e706fed1/b09IufBT31yDxg_BRZR3w.webp)   |
+| **nvfp4mixed** | ![anima-preview-nvfp4](https://cdn-uploads.huggingface.co/production/uploads/63fbf6951b4b1bd4e706fed1/FMcirkGeDMAdbzkniOARF.webp) |
 ## Quantized layers