maximg commited on
Commit
f81e2e5
·
verified ·
1 Parent(s): 2b943bb

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -16,7 +16,7 @@ This repository contains GGUF quantized versions of [Qwen/Qwen3.6-27B](https://h
16
 
17
  ## Quantization Details
18
 
19
- The models were generated using Intel's AutoRound with MTP layers explicitly enabled:
20
 
21
  ```bash
22
  auto-round \
@@ -35,5 +35,5 @@ For now, only 2 quantization variants were used Q5_K_M and Q4_K_MIXED. Q4_K_MIXE
35
 
36
  | File Name | Quant Type | Size |
37
  |-----------|------------|------|
38
- | `Qwen3.6-27B-Q2_K_MIXED.gguf` | Q2_K_MIXED | 12 GB |
39
- | `Qwen3.6-27B-Q5_K_M.gguf` | Q5_K_M | 18 GB |
 
16
 
17
  ## Quantization Details
18
 
19
+ The models were generated using Intel's AutoRound using ultrachat_200k as the test dataset and using sequence length of 2850. MTP layers were not explicitly enabled, but it works with MTP for me
20
 
21
  ```bash
22
  auto-round \
 
35
 
36
  | File Name | Quant Type | Size |
37
  |-----------|------------|------|
38
+ | `Qwen3.6-27B-Q2_K_MIXED.gguf` | Q2_K_MIXED | 16.2 GB |
39
+ | `Qwen3.6-27B-Q5_K_M.gguf` | Q5_K_M | 18.5 GB |