ApacheOne
/

Nucleus-Image-NVFP4_mixed

Model card Files Files and versions

ApacheOne commited on Apr 17

Commit

84fd786

·

verified ·

1 Parent(s): 7a6267b

Update README.md

Files changed (1) hide show

README.md +9 -0

README.md CHANGED Viewed

@@ -5,6 +5,15 @@ base_model:
 base_model_relation: quantized
 ---
 Not sure of the best way to even run the base model yet, Should work where you offload the dense layers into ram and only leave the active layers in vram.
 # UNTESTED

 base_model_relation: quantized
 ---
+# UPDATE
+I am still getting OOM on a L4 24GB , 64gb ram system.
+Aggressive quant is most likely a failure and will not work as is.
+2d weights only quant does load and run but still getting OOM from improper loading method.
 Not sure of the best way to even run the base model yet, Should work where you offload the dense layers into ram and only leave the active layers in vram.
 # UNTESTED