File size: 1,381 Bytes
497be85
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
ed1945e
497be85
 
 
 
 
10c26b1
28d0e22
 
 
d6caed4
 
 
9c4d0d2
d6caed4
 
 
 
28d0e22
 
 
 
d6caed4
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
---
base_model: 0xSero/qwen3-coder-next-56b-REAP
base_model_relation: quantized
language:
- en
license: apache-2.0
tags:
- quantized
- gguf
- qwen3
- moe
---

# Qwen3-Coder-Next 56B REAP - GGUF

Quantized GGUF versions of [0xSero/qwen3-coder-next-56b-REAP](https://huggingface.co/0xSero/qwen3-coder-next-56b-REAP).
These were generated using the default settings with `llama-quantize` (b8740).

## Quantizations provided

| File | Quantization | Size |
|------|-------------|------|
| `qwen3-coder-next-56b-REAP-Q4_K_M.gguf` | Q4_K_M | 34.4 GB |
| `qwen3-coder-next-56b-REAP-Q5_K_M.gguf` | Q5_K_M | 40.3 GB |
| `qwen3-coder-next-56b-REAP-Q6_K.gguf` | Q6_K | 46.5 GB |
| `qwen3-coder-next-56b-REAP-Q8_0.gguf` | Q8_0 | 60.2 GB |

## Perplexity test

I tested perplexity using `llama-perplexity` and Salesforce's [wikitext-2-raw-v1](https://huggingface.co/datasets/Salesforce/wikitext/tree/main/wikitext-2-raw-v1).

| File | Quantization | Ctx | PPL |
|------|--------------|-----|-----|
| `qwen3-coder-next-56b-REAP-Q4_K_M.gguf` | Q4_K_M | 512 | 15.3702 +/- 0.13301 |
| `qwen3-coder-next-56b-REAP-Q5_K_M.gguf` | Q5_K_M | 512 | 15.2810 +/- 0.13196 |
| `qwen3-coder-next-56b-REAP-Q6_K.gguf` | Q6_K | 512 | 15.1305 +/- 0.13011 |
| `qwen3-coder-next-56b-REAP-Q8_0.gguf` | Q8_0 | 512 | 15.1198 +/- 0.13009 |
| `qwen3-coder-next-56b-REAP-BF16.gguf` | BF16 | 512 | 15.1274 +/- 0.13022 |