Instructions to use deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct", trust_remote_code=True)
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct", trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained("deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct", trust_remote_code=True)
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct

SGLang

How to use deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct with Docker Model Runner:
```
docker model run hf.co/deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct
```

DeepSeek-Coder-V2-Lite-Instruct

Commit History

Update modeling_deepseek.py

432f2d1
verified

mukulp commited on Sep 18, 2024

Update README.md

e434a23
verified

guoday commited on Jul 3, 2024

Update modeling_deepseek.py

45d0aa4

mashirong commited on Jun 24, 2024

Update README.md

ec228ab
verified

guoday commited on Jun 19, 2024

Update README.md

21f7d9f
verified

guoday commited on Jun 18, 2024

Update README.md

b6ae3a9
verified

guoday commited on Jun 17, 2024

Update README.md

7389984
verified

guoday commited on Jun 17, 2024

Create README.md

c48c7cf
verified

guoday commited on Jun 16, 2024

Remove unused file

53268db

mashirong commited on Jun 14, 2024

Upload folder using huggingface_hub

34397f9
verified

msr2000 commited on Jun 14, 2024

initial commit

29cd052
verified

msr2000 commited on Jun 14, 2024

Commit History

Update modeling_deepseek.py 432f2d1 verified

Update README.md e434a23 verified

Update modeling_deepseek.py 45d0aa4

Update README.md ec228ab verified

Update README.md 21f7d9f verified

Update README.md b6ae3a9 verified

Update README.md 7389984 verified

Create README.md c48c7cf verified

Remove unused file 53268db

Upload folder using huggingface_hub 34397f9 verified

initial commit 29cd052 verified