CodeFault commited on
Commit
33c8fbf
·
verified ·
1 Parent(s): c014f82

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -29,7 +29,7 @@ These were generated using the default settings with `llama-quantize` (b9482).
29
  | `Mellum2-12B-A2.5B-Instruct-Q4_K_M.gguf` | Q4_K_M | 8.07 GB |
30
  | `Mellum2-12B-A2.5B-Instruct-Q5_K_M.gguf` | Q5_K_M | 9.21 GB |
31
  | `Mellum2-12B-A2.5B-Instruct-Q6_K.gguf` | Q6_K | 10.9 GB |
32
- | `Mellum2-12B-A2.5B-Instruct-Q8_0.gguf` | Q8_K | 12.9 GB |
33
 
34
  <small>1: Q4_0 is not recommended. Perplexity increased significantly which suggests degredated quality. I did not encounter endlessly repeating tokens like with the thinking variation at Q4_0.</small>
35
 
 
29
  | `Mellum2-12B-A2.5B-Instruct-Q4_K_M.gguf` | Q4_K_M | 8.07 GB |
30
  | `Mellum2-12B-A2.5B-Instruct-Q5_K_M.gguf` | Q5_K_M | 9.21 GB |
31
  | `Mellum2-12B-A2.5B-Instruct-Q6_K.gguf` | Q6_K | 10.9 GB |
32
+ | `Mellum2-12B-A2.5B-Instruct-Q8_0.gguf` | Q8_0 | 12.9 GB |
33
 
34
  <small>1: Q4_0 is not recommended. Perplexity increased significantly which suggests degredated quality. I did not encounter endlessly repeating tokens like with the thinking variation at Q4_0.</small>
35