zRzRzRzRzRzRzR commited on
Commit
120edc2
Β·
1 Parent(s): f6142f1

update with Ascend support

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -66,13 +66,13 @@ We're introducing GLM-5.2, our latest flagship model for long-horizon tasks. It
66
 
67
  ## Serve GLM-5.2 Locally
68
 
69
- The following open-source frameworks support local deployment of GLM-5.2:
70
 
71
  - [SGLang](https://github.com/sgl-project/sglang) (v0.5.13.post1+) β€” see [cookbook](https://cookbook.sglang.io/autoregressive/GLM/GLM-5.2)
72
- - [vLLM](https://github.com/vllm-project/vllm) (v0.23.0+) β€” see [recipes](https://github.com/vllm-project/recipes/blob/main/GLM/GLM5.md)
73
- - [xLLM](https://github.com/jd-opensource/xllm) (v0.10.0+) β€” see [example](https://github.com/zai-org/GLM-5/blob/main/example/ascend.md)
74
  - [Transformers](https://github.com/huggingface/transformers) (v0.5.12+) β€” see [transformers docs](https://github.com/huggingface/transformers/blob/main/docs/source/en/model_doc/glm_moe_dsa.md)
75
  - [KTransformers](https://github.com/kvcache-ai/ktransformers) (v0.5.12+) β€” see [tutorial](https://github.com/kvcache-ai/ktransformers/blob/main/doc/en/kt-kernel/GLM-5.2-Tutorial.md)
 
76
 
77
  ## Citation
78
 
 
66
 
67
  ## Serve GLM-5.2 Locally
68
 
69
+ GLM-5.2 supports deployment with the following frameworks. Feel free to try them out:
70
 
71
  - [SGLang](https://github.com/sgl-project/sglang) (v0.5.13.post1+) β€” see [cookbook](https://cookbook.sglang.io/autoregressive/GLM/GLM-5.2)
72
+ - [vLLM](https://github.com/vllm-project/vllm) (v0.23.0+) β€” see [recipes](https://recipes.vllm.ai/zai-org/GLM-5.2)
 
73
  - [Transformers](https://github.com/huggingface/transformers) (v0.5.12+) β€” see [transformers docs](https://github.com/huggingface/transformers/blob/main/docs/source/en/model_doc/glm_moe_dsa.md)
74
  - [KTransformers](https://github.com/kvcache-ai/ktransformers) (v0.5.12+) β€” see [tutorial](https://github.com/kvcache-ai/ktransformers/blob/main/doc/en/kt-kernel/GLM-5.2-Tutorial.md)
75
+ - For deployment on the `Ascend NPU` platform, inference frameworks such as vLLM-Ascend, xLLM and SGLang are supported β€” see [here](github.com/zai-org/GLM-5/blob/main/example/ascend.md).
76
 
77
  ## Citation
78