bombman
/

Nucleus-Image-FP8-Native

NucleusMoEImagePipeline

Model card Files Files and versions

bombman commited on Apr 19

Commit

a11dbf4

·

verified ·

1 Parent(s): a798bb0

Update README.md

Files changed (1) hide show

README.md +17 -3

README.md CHANGED Viewed

@@ -1,3 +1,17 @@
----
-license: creativeml-openrail-m
----

+---
+license: creativeml-openrail-m
+base_model: nucleus-image
+tags:
+- text-to-image
+- fp8
+---
+# Nucleus-Image-FP8-Native (Heavyweight Edition)
+This is a native FP8 (float8_e4m3fn) quantization of the 17B Nucleus-Image model.
+## ⚠️ VRAM Warning
+This model is **EXTREMELY HEAVY**. Even in FP8, the weights alone take up ~13-14GB of VRAM.
+- **16GB VRAM (RTX 4060 Ti / 4070 Ti Super / 4080):** Recommended to use with `sequential_cpu_offload` for stability. Pure GPU inference might OOM at 1024x1024.
+- **24GB VRAM (RTX 3090 / 4090):** Best experience. Can run Pure GPU without offloading.
+- **12GB VRAM or less:** **NOT RECOMMENDED** unless using heavy CPU offloading (will be slow).