tclf90 commited on
Commit
4e7ab3e
·
verified ·
1 Parent(s): e3ac9c8

new installation instuction for stable/correct performance

Browse files
Files changed (1) hide show
  1. README.md +11 -4
README.md CHANGED
@@ -17,17 +17,18 @@ base_model_relation: quantized
17
  Base model: [DeepSeek-V3.1](https://huggingface.co/deepseek-ai/DeepSeek-V3.1)
18
 
19
  ### 【Dependencies / Installation】
20
- As of **2025-08-23**, create a fresh Python environment and run:
21
 
22
  ```bash
23
  # ❗there are glitches with vllm 0.10.1.1, still looking for resolutions❗
24
  # ❗downgrade vllm for now ❗
25
- pip install vllm==0.9.0
26
- pip install transformers==4.53
27
 
28
- # ❗patch up AWQ MoE quant config, otherwise some modules cannot be properly loaded❗
29
  SITE_PACKAGES=$(pip -V | awk '{print $4}' | sed 's/\/pip$//')
 
30
  cp awq_marlin.py "$SITE_PACKAGES/vllm/model_executor/layers/quantization/awq_marlin.py"
 
 
31
  ```
32
 
33
  ### 【vLLM Single Node with 8 GPUs — Startup Command】
@@ -51,6 +52,12 @@ vllm serve \
51
 
52
  ### 【Logs】
53
  ```
 
 
 
 
 
 
54
  2025-08-23
55
  1. Initial commit
56
  ```
 
17
  Base model: [DeepSeek-V3.1](https://huggingface.co/deepseek-ai/DeepSeek-V3.1)
18
 
19
  ### 【Dependencies / Installation】
20
+ As of **2025-08-27**, create a fresh Python environment and run:
21
 
22
  ```bash
23
  # ❗there are glitches with vllm 0.10.1.1, still looking for resolutions❗
24
  # ❗downgrade vllm for now ❗
25
+ pip install vllm==0.9.2 transformers==4.53.0
 
26
 
 
27
  SITE_PACKAGES=$(pip -V | awk '{print $4}' | sed 's/\/pip$//')
28
+ # ❗patch up AWQ MoE quant config, otherwise some modules cannot be properly loaded❗
29
  cp awq_marlin.py "$SITE_PACKAGES/vllm/model_executor/layers/quantization/awq_marlin.py"
30
+ # ❗patch up for fp32 e_score_correction_bias, see https://www.github.com/vllm-project/vllm/pull/23640❗
31
+ cp deepseek_v2.py "$SITE_PACKAGES/vllm/model_executor/models/deepseek_v2.py"
32
  ```
33
 
34
  ### 【vLLM Single Node with 8 GPUs — Startup Command】
 
52
 
53
  ### 【Logs】
54
  ```
55
+ 2025-08-27
56
+ 1. new installation instuction for stable/correct performance
57
+ (a) use vllm 0.9.2 instead of 0.9.0:
58
+ there is unidentified issue with 0.9.0 🥹 which causes numerical error
59
+ (b) patch up deepseek_v2.py for fp32 e_score_correction_bias
60
+
61
  2025-08-23
62
  1. Initial commit
63
  ```