CodeFault commited on
Commit
d6caed4
·
verified ·
1 Parent(s): 497be85

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -1
README.md CHANGED
@@ -22,4 +22,19 @@ Quantized GGUF versions of [0xSero/qwen3-coder-next-56b-REAP](https://huggingfac
22
  | `qwen3-coder-next-56b-REAP-Q8_0.gguf` | Q8_0 | 60.2 GB |
23
  | `qwen3-coder-next-56b-REAP-Q6_K.gguf` | Q6_K | 46.5 GB |
24
  | `qwen3-coder-next-56b-REAP-Q5_K_M.gguf` | Q5_K_M | 40.3 GB |
25
- | `qwen3-coder-next-56b-REAP-Q4_K_M.gguf` | Q4_K_M | 34.4 GB |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
22
  | `qwen3-coder-next-56b-REAP-Q8_0.gguf` | Q8_0 | 60.2 GB |
23
  | `qwen3-coder-next-56b-REAP-Q6_K.gguf` | Q6_K | 46.5 GB |
24
  | `qwen3-coder-next-56b-REAP-Q5_K_M.gguf` | Q5_K_M | 40.3 GB |
25
+ | `qwen3-coder-next-56b-REAP-Q4_K_M.gguf` | Q4_K_M | 34.4 GB |
26
+
27
+ ## Perplexity test
28
+
29
+ I tested perplexity using `llama-perplexity` and Saleforce's [wikitext-2-raw-v1](https://huggingface.co/datasets/Salesforce/wikitext/tree/main/wikitext-2-raw-v1).
30
+
31
+ | File | Quantization | Ctx | PPL |
32
+ |------|--------------|-----|-----|
33
+ | `qwen3-coder-next-56b-REAP-BF16.gguf` | BF16 | 512 | 15.1274 +/- 0.13022 |
34
+ | `qwen3-coder-next-56b-REAP-Q8_0.gguf` | Q8_0 | 512 | 15.1198 +/- 0.13009 |
35
+ | `qwen3-coder-next-56b-REAP-Q6_K.gguf` | Q6_K | 512 | 15.1305 +/- 0.13011 |
36
+ | `qwen3-coder-next-56b-REAP-Q5_K_M.gguf` | Q5_K_M | 512 | 15.2810 +/- 0.13196 |
37
+ | `qwen3-coder-next-56b-REAP-Q4_K_M.gguf` | Q4_K_M | 512 | 15.3702 +/- 0.13301 |
38
+
39
+
40
+