wenhuach commited on
Commit
3dc616e
·
verified ·
1 Parent(s): 6cd3fc1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -25,7 +25,7 @@ pip install git+https://github.com/huggingface/transformers.git
25
  **MTP has not been supported, we will try to fix it later.**
26
 
27
  ~~~bash
28
- CUDA_VISIBLE_DEVICES=1,2,3,4 vllm serve /xuehaosu/glm5_whc/GLM-5-w4g128/ \
29
  --tensor-parallel-size 4 \
30
  --gpu-memory-utilization 0.85 \
31
  --tool-call-parser glm47 \
 
25
  **MTP has not been supported, we will try to fix it later.**
26
 
27
  ~~~bash
28
+ CUDA_VISIBLE_DEVICES=1,2,3,4 vllm serve Intel/GLM-5-int4-mixed-AutoRound \
29
  --tensor-parallel-size 4 \
30
  --gpu-memory-utilization 0.85 \
31
  --tool-call-parser glm47 \