ApacheOne commited on
Commit
7f91f3f
·
verified ·
1 Parent(s): 141cb69

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +18 -3
README.md CHANGED
@@ -1,3 +1,18 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model:
4
+ - NucleusAI/Nucleus-Image
5
+ base_model_relation: quantized
6
+ ---
7
+
8
+ Not sure of the best way to even run the base model yet, Should work where you offload the dense layers into ram and only leave the active layers in vram.
9
+
10
+ # UNTESTED
11
+
12
+ I cant test this model myself as its too big.
13
+
14
+ `Nucleus-Image_noreshape2Dweightonly.safetensors` This model has only the 2D non-MoE expert layers quantized to nvfp4 which should give speed to the active layers in vram without loss of the dense layers.
15
+
16
+ I will try testing this model once more support comes out for the loading methods of the model.
17
+
18
+ `Nucleus-Image_transformer_aggressive_nvfp4.safetensors` This model is every layer quanted just for context reasons of what is possblie. The dense layers might be sensitive to the quant.