Instructions to use moonshotai/Kimi-K2-Instruct-0905 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use moonshotai/Kimi-K2-Instruct-0905 with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="moonshotai/Kimi-K2-Instruct-0905", trust_remote_code=True)
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoModelForCausalLM
model = AutoModelForCausalLM.from_pretrained("moonshotai/Kimi-K2-Instruct-0905", trust_remote_code=True, device_map="auto")

Inference
HuggingChat
Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use moonshotai/Kimi-K2-Instruct-0905 with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "moonshotai/Kimi-K2-Instruct-0905"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "moonshotai/Kimi-K2-Instruct-0905",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/moonshotai/Kimi-K2-Instruct-0905

SGLang

How to use moonshotai/Kimi-K2-Instruct-0905 with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "moonshotai/Kimi-K2-Instruct-0905" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "moonshotai/Kimi-K2-Instruct-0905",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "moonshotai/Kimi-K2-Instruct-0905" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "moonshotai/Kimi-K2-Instruct-0905",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use moonshotai/Kimi-K2-Instruct-0905 with Docker Model Runner:
```
docker model run hf.co/moonshotai/Kimi-K2-Instruct-0905
```

Kimi-K2-Instruct-0905

Commit History

Transformers v5 support (#21)

ac6c49f
verified

courage17340

hmellor HF Staff commited on Jan 30

fix-vocab-size (#17)

12d9c7c
verified

bigmoyan commited on Nov 7, 2025

not add functions. in tool id (#15)

94a4053
verified

bigmoyan commited on Oct 22, 2025

remove-auto-add-str-functions (#14)

975af05
verified

bigmoyan commited on Oct 22, 2025

fix apply_chat_template (#13)

46ec167
verified

bigmoyan commited on Oct 22, 2025

update_chat_template_and_tokenizer (#12)

09d5f93
verified

bigmoyan commited on Oct 10, 2025

Update README.md

7152993
verified

jerryzhu423 commited on Sep 5, 2025

Update README.md

2ce3235
verified

lsw825 commited on Sep 5, 2025

Update README.md

e4f9671
verified

xxr3376 commited on Sep 5, 2025

Update README.md

65191cf
verified

jerryzhu423 commited on Sep 5, 2025

Update README.md

d30fdf6
verified

bigmoyan commited on Sep 4, 2025

add FAQ for tool calls.

d56abb2
verified

bigmoyan commited on Sep 4, 2025

update readme

13adc1d

liushaowei commited on Sep 3, 2025

Add files using upload-large-folder tool

b1ad81b
verified

lsw825 commited on Sep 3, 2025

Add files using upload-large-folder tool

eb1e816
verified

lsw825 commited on Sep 3, 2025

Upload config.json with huggingface_hub

440d282
verified

lsw825 commited on Sep 3, 2025

initial commit

86c87f9
verified

lsw825 commited on Sep 3, 2025

Commit History

Transformers v5 support (#21) ac6c49f verified

fix-vocab-size (#17) 12d9c7c verified

not add functions. in tool id (#15) 94a4053 verified

remove-auto-add-str-functions (#14) 975af05 verified

fix apply_chat_template (#13) 46ec167 verified

update_chat_template_and_tokenizer (#12) 09d5f93 verified

Update README.md 7152993 verified

Update README.md 2ce3235 verified

Update README.md e4f9671 verified

Update README.md 65191cf verified

Update README.md d30fdf6 verified

add FAQ for tool calls. d56abb2 verified

update readme 13adc1d

Add files using upload-large-folder tool b1ad81b verified

Add files using upload-large-folder tool eb1e816 verified

Upload config.json with huggingface_hub 440d282 verified

initial commit 86c87f9 verified

Transformers v5 support (#21)

ac6c49f
verified

fix-vocab-size (#17)

12d9c7c
verified

not add functions. in tool id (#15)

94a4053
verified

remove-auto-add-str-functions (#14)

975af05
verified

fix apply_chat_template (#13)

46ec167
verified

update_chat_template_and_tokenizer (#12)

09d5f93
verified

Update README.md

7152993
verified

Update README.md

2ce3235
verified

Update README.md

e4f9671
verified

Update README.md

65191cf
verified

Update README.md

d30fdf6
verified

add FAQ for tool calls.

d56abb2
verified

update readme

13adc1d

Add files using upload-large-folder tool

b1ad81b
verified

Add files using upload-large-folder tool

eb1e816
verified

Upload config.json with huggingface_hub

440d282
verified

initial commit

86c87f9
verified