q4_k_m: Recommended. Uses Q6_K for half of the attention.wv and feed_forward.w2 tensors, else Q4_K
Browse files
.gitattributes
CHANGED
|
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
Yugo45-GPT-Quantized-GGUF-unsloth.Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
Yugo45-GPT-Quantized-GGUF-unsloth.Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
| 36 |
+
Yugo45-GPT-Quantized-GGUF.Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
Yugo45-GPT-Quantized-GGUF-unsloth.Q4_K_M.gguf → Yugo45-GPT-Quantized-GGUF.Q4_K_M.gguf
RENAMED
|
File without changes
|