maximg commited on
Commit
2b943bb
·
verified ·
1 Parent(s): 4b3163a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +36 -0
README.md CHANGED
@@ -1,3 +1,39 @@
1
  ---
2
  license: apache-2.0
 
 
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
+ language:
4
+ - multilingual
5
+ base_model: Qwen/Qwen3.6-27B
6
+ tags:
7
+ - auto-round
8
+ - intel
9
+ - gguf
10
+ - quantization
11
  ---
12
+
13
+ # Qwen3.6-27B GGUF (AutoRound Quantized, MTP Enabled)
14
+
15
+ This repository contains GGUF quantized versions of [Qwen/Qwen3.6-27B](https://huggingface.co/Qwen/Qwen3.6-27B) created using Intel's [AutoRound](https://github.com/intel/auto-round) quantization method.
16
+
17
+ ## Quantization Details
18
+
19
+ The models were generated using Intel's AutoRound with MTP layers explicitly enabled:
20
+
21
+ ```bash
22
+ auto-round \
23
+ --model Qwen/Qwen3.6-27B \
24
+ --output_dir ./quantized/ \
25
+ --scheme <SCHEME> \
26
+ --format <SCHEME> \
27
+ --iters 0 \
28
+ --nsamples 256 --seqlen 2850 --dataset "HuggingFaceH4/ultrachat_200k"
29
+ ```
30
+
31
+ For now, only 2 quantization variants were used Q5_K_M and Q4_K_MIXED. Q4_K_MIXED is a custom variant based on Intel's original Q2_K_MIXED quantization, but using Q4_K quants instead of Q2.
32
+
33
+
34
+ ### Files and Sizes
35
+
36
+ | File Name | Quant Type | Size |
37
+ |-----------|------------|------|
38
+ | `Qwen3.6-27B-Q2_K_MIXED.gguf` | Q2_K_MIXED | 12 GB |
39
+ | `Qwen3.6-27B-Q5_K_M.gguf` | Q5_K_M | 18 GB |