ApacheOne
/

Nucleus-Image-NVFP4_mixed

Model card Files Files and versions

ApacheOne commited on Apr 16

Commit

7f91f3f

·

verified ·

1 Parent(s): 141cb69

Update README.md

Files changed (1) hide show

README.md +18 -3

README.md CHANGED Viewed

@@ -1,3 +1,18 @@
----
-license: apache-2.0
----

+---
+license: apache-2.0
+base_model:
+- NucleusAI/Nucleus-Image
+base_model_relation: quantized
+---
+Not sure of the best way to even run the base model yet, Should work where you offload the dense layers into ram and only leave the active layers in vram.
+# UNTESTED
+I cant test this model myself as its too big.
+`Nucleus-Image_noreshape2Dweightonly.safetensors` This model has only the 2D non-MoE expert layers quantized to nvfp4 which should give speed to the active layers in vram without loss of the dense layers.
+I will try testing this model once more support comes out for the loading methods of the model.
+`Nucleus-Image_transformer_aggressive_nvfp4.safetensors` This model is every layer quanted just for context reasons of what is possblie. The dense layers might be sensitive to the quant.