Instructions to use zai-org/GLM-5 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use zai-org/GLM-5 with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="zai-org/GLM-5")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("zai-org/GLM-5")
model = AutoModelForCausalLM.from_pretrained("zai-org/GLM-5")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Inference
HuggingChat
Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use zai-org/GLM-5 with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "zai-org/GLM-5"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "zai-org/GLM-5",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/zai-org/GLM-5

SGLang

How to use zai-org/GLM-5 with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "zai-org/GLM-5" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "zai-org/GLM-5",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "zai-org/GLM-5" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "zai-org/GLM-5",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use zai-org/GLM-5 with Docker Model Runner:
```
docker model run hf.co/zai-org/GLM-5
```

GLM-5

Commit History

Add MathArena evaluation result for aime/aime_2026

7a3af02
verified

JasperDekoninck commited on Feb 18

Add evaluation results for GPQA, HLE (#22)

b8a9fc5

ZHANGYUXUAN-zR

SaylorTwift HF Staff commited on Feb 13

Update README.md (#18)

360b49d

ZHANGYUXUAN-zR

UnicornChan commited on Feb 13

update

83d08ca

zRzRzRzRzRzRzR commited on Feb 11

line

9329f32

zRzRzRzRzRzRzR commited on Feb 11

sglang update

17b316b

zRzRzRzRzRzRzR commited on Feb 11

work

cce0887

zRzRzRzRzRzRzR commited on Feb 11

init3

fbfe45b

zRzRzRzRzRzRzR commited on Feb 11

init2

c35b39f

zRzRzRzRzRzRzR commited on Feb 11

init2

1a68bb7

zRzRzRzRzRzRzR commited on Feb 11

init

baee9b2

zRzRzRzRzRzRzR commited on Feb 11

Add files using upload-large-folder tool

773a94e
verified

ZHANGYUXUAN-zR commited on Feb 11

Add files using upload-large-folder tool

5c90e22
verified

ZHANGYUXUAN-zR commited on Feb 11

Add files using upload-large-folder tool

9e60996
verified

ZHANGYUXUAN-zR commited on Feb 11

Add files using upload-large-folder tool

fb2dfc3
verified

ZHANGYUXUAN-zR commited on Feb 11

Add files using upload-large-folder tool

b9c39a3
verified

ZHANGYUXUAN-zR commited on Feb 11

Add files using upload-large-folder tool

22ce68a
verified

ZHANGYUXUAN-zR commited on Feb 11

initial commit

670766b
verified

ZHANGYUXUAN-zR commited on Feb 11

Commit History

Add MathArena evaluation result for aime/aime_2026 7a3af02 verified

Add evaluation results for GPQA, HLE (#22) b8a9fc5

Update README.md (#18) 360b49d

update 83d08ca

line 9329f32

sglang update 17b316b

work cce0887

init3 fbfe45b

init2 c35b39f

init2 1a68bb7

init baee9b2

Add files using upload-large-folder tool 773a94e verified

Add files using upload-large-folder tool 5c90e22 verified

Add files using upload-large-folder tool 9e60996 verified

Add files using upload-large-folder tool fb2dfc3 verified

Add files using upload-large-folder tool b9c39a3 verified

Add files using upload-large-folder tool 22ce68a verified

initial commit 670766b verified

Add MathArena evaluation result for aime/aime_2026

7a3af02
verified

Add evaluation results for GPQA, HLE (#22)

b8a9fc5

Update README.md (#18)

360b49d

update

83d08ca

line

9329f32

sglang update

17b316b

work

cce0887

init3

fbfe45b

init2

c35b39f

init2

1a68bb7

init

baee9b2

Add files using upload-large-folder tool

773a94e
verified

Add files using upload-large-folder tool

5c90e22
verified

Add files using upload-large-folder tool

9e60996
verified

Add files using upload-large-folder tool

fb2dfc3
verified

Add files using upload-large-folder tool

b9c39a3
verified

Add files using upload-large-folder tool

22ce68a
verified

initial commit

670766b
verified