tclf90 commited on
Commit
b4f3c57
·
verified ·
1 Parent(s): 318fee0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -6,11 +6,11 @@ tags:
6
  - GPTQ
7
  - vLLM
8
  base_model:
9
- - ZhipuAI/GLM-4.6
10
  base_model_relation: quantized
11
  ---
12
  # GLM-4.6-GPTQ-Int4-Int8Mix
13
- Base Model: [ZhipuAI/GLM-4.6](https://www.modelscope.cn/models/ZhipuAI/GLM-4.6)
14
 
15
  ### 【Dependencies / Installation】
16
  As of **2025-10-01**, create a fresh Python environment and run:
@@ -26,7 +26,7 @@ otherwise the expert tensors couldn’t be evenly sharded across GPU devices.</i
26
  ```
27
  CONTEXT_LENGTH=32768
28
  vllm serve \
29
- tclf90/GLM-4.6-GPTQ-Int4-Int8Mix \
30
  --served-model-name My_Model \
31
  --enable-auto-tool-choice \
32
  --tool-call-parser glm45 \
@@ -57,7 +57,7 @@ vllm serve \
57
  ### 【Model Download】
58
  ```python
59
  from modelscope import snapshot_download
60
- snapshot_download('tclf90/GLM-4.6-GPTQ-Int4-Int8Mix', cache_dir="your_local_path")
61
  ```
62
 
63
  ### 【Overview】
 
6
  - GPTQ
7
  - vLLM
8
  base_model:
9
+ - zai-org/GLM-4.6
10
  base_model_relation: quantized
11
  ---
12
  # GLM-4.6-GPTQ-Int4-Int8Mix
13
+ Base Model: [zai-org/GLM-4.6](https://huggingface.co/zai-org/GLM-4.6)
14
 
15
  ### 【Dependencies / Installation】
16
  As of **2025-10-01**, create a fresh Python environment and run:
 
26
  ```
27
  CONTEXT_LENGTH=32768
28
  vllm serve \
29
+ QuantTrio/GLM-4.6-GPTQ-Int4-Int8Mix \
30
  --served-model-name My_Model \
31
  --enable-auto-tool-choice \
32
  --tool-call-parser glm45 \
 
57
  ### 【Model Download】
58
  ```python
59
  from modelscope import snapshot_download
60
+ snapshot_download('QuantTrio/GLM-4.6-GPTQ-Int4-Int8Mix', cache_dir="your_local_path")
61
  ```
62
 
63
  ### 【Overview】