PsiPi
/

cwm-Q2_K-GGUF

Model card Files Files and versions

PsiPi commited on Sep 28, 2025

Commit

be86c5c

·

verified ·

1 Parent(s): 17de383

Update README.md

Files changed (1) hide show

README.md +7 -0

README.md CHANGED Viewed

@@ -20,6 +20,13 @@ Refer to the [original model card](https://huggingface.co/facebook/cwm) for more
 - Layer Offload **64**
 - Context Length **~50k**
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)

 - Layer Offload **64**
 - Context Length **~50k**
+## Fitting on 24gb in LMStudio @Q4_0
+- Flash attention **ENABLED**
+- K Cache Quant type **Q4_0**
+- V Cache Quant type **Q4_0**
+- Layer Offload **64**
+- Context Length **131072**
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)