cpatonn commited on
Commit
0c20986
·
verified ·
1 Parent(s): bf7e04c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +54 -1
README.md CHANGED
@@ -3,11 +3,64 @@ license: other
3
  license_name: iquestcoder
4
  license_link: https://huggingface.co/IQuestLab/IQuest-Coder-V1-40B-Instruct/blob/main/LICENSE
5
  language:
6
- - en
7
  library_name: transformers
8
  base_model: IQuestLab/IQuest-Coder-V1-40B-Instruct
9
  ---
10
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
11
 
12
  ![Evaluation Results](./papers/iquest-coder-v1-logo.png)
13
 
 
3
  license_name: iquestcoder
4
  license_link: https://huggingface.co/IQuestLab/IQuest-Coder-V1-40B-Instruct/blob/main/LICENSE
5
  language:
6
+ - en
7
  library_name: transformers
8
  base_model: IQuestLab/IQuest-Coder-V1-40B-Instruct
9
  ---
10
 
11
+ # IQuest-Coder-V1-40B-Instruct AWQ - INT8
12
+
13
+ ## Model Details
14
+
15
+ ### Quantization Details
16
+
17
+ - **Quantization Method:** AWQ
18
+ - **Bits:** 8
19
+ - **Group Size:** 32
20
+ - **Calibration Dataset:** [princeton-nlp/SWE-bench_Verified](https://huggingface.co/datasets/princeton-nlp/SWE-bench_Verified)
21
+ - **Quantization Tool:** [llm-compressor](https://github.com/vllm-project/llm-compressor)
22
+
23
+ ### Memory Usage
24
+
25
+ | **Type** | **IQuest-Coder-V1-40B-Instruct** | **IQuest-Coder-V1-40B-Instruct-AWQ-8bit** |
26
+ |:---------------:|:----------------:|:----------------:|
27
+ | **Memory Size** | 74.1 GB | 40.1 GB |
28
+ | **KV Cache per Token** | 320.0 kB | 160.0 kB |
29
+ | **KV Cache per Context** | 40.0 GB | 20.0 GB |
30
+
31
+ ## Inference
32
+
33
+ ### Prerequisite
34
+
35
+
36
+ ```bash
37
+ git clone https://github.com/vllm-project/vllm.git
38
+ cd vllm
39
+ git fetch origin pull/31575/head:pr-31575
40
+ git checkout pr-31575
41
+ VLLM_USE_PRECOMPILED=1 pip install -e .
42
+ ```
43
+
44
+
45
+ ### Basic Usage
46
+
47
+
48
+ ```bash
49
+ vllm serve cyankiwi/IQuest-Coder-V1-40B-Instruct-AWQ-8bit --trust-remote-code
50
+ ```
51
+
52
+
53
+ ## Additional Information
54
+
55
+ ### Changelog
56
+
57
+ - **v1.0.0** - Initial quantized release
58
+
59
+ ### Authors
60
+
61
+ - **Name:** Ton Cao
62
+ - **Contacts:** ton@cyan.kiwi
63
+
64
 
65
  ![Evaluation Results](./papers/iquest-coder-v1-logo.png)
66