CodeFault's picture
Update README.md
9c4d0d2 verified
---
base_model: 0xSero/qwen3-coder-next-56b-REAP
base_model_relation: quantized
language:
- en
license: apache-2.0
tags:
- quantized
- gguf
- qwen3
- moe
---
# Qwen3-Coder-Next 56B REAP - GGUF
Quantized GGUF versions of [0xSero/qwen3-coder-next-56b-REAP](https://huggingface.co/0xSero/qwen3-coder-next-56b-REAP).
These were generated using the default settings with `llama-quantize` (b8740).
## Quantizations provided
| File | Quantization | Size |
|------|-------------|------|
| `qwen3-coder-next-56b-REAP-Q4_K_M.gguf` | Q4_K_M | 34.4 GB |
| `qwen3-coder-next-56b-REAP-Q5_K_M.gguf` | Q5_K_M | 40.3 GB |
| `qwen3-coder-next-56b-REAP-Q6_K.gguf` | Q6_K | 46.5 GB |
| `qwen3-coder-next-56b-REAP-Q8_0.gguf` | Q8_0 | 60.2 GB |
## Perplexity test
I tested perplexity using `llama-perplexity` and Salesforce's [wikitext-2-raw-v1](https://huggingface.co/datasets/Salesforce/wikitext/tree/main/wikitext-2-raw-v1).
| File | Quantization | Ctx | PPL |
|------|--------------|-----|-----|
| `qwen3-coder-next-56b-REAP-Q4_K_M.gguf` | Q4_K_M | 512 | 15.3702 +/- 0.13301 |
| `qwen3-coder-next-56b-REAP-Q5_K_M.gguf` | Q5_K_M | 512 | 15.2810 +/- 0.13196 |
| `qwen3-coder-next-56b-REAP-Q6_K.gguf` | Q6_K | 512 | 15.1305 +/- 0.13011 |
| `qwen3-coder-next-56b-REAP-Q8_0.gguf` | Q8_0 | 512 | 15.1198 +/- 0.13009 |
| `qwen3-coder-next-56b-REAP-BF16.gguf` | BF16 | 512 | 15.1274 +/- 0.13022 |