Instructions to use unsloth/DeepSeek-R1-GGUF-UD with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use unsloth/DeepSeek-R1-GGUF-UD with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="unsloth/DeepSeek-R1-GGUF-UD", trust_remote_code=True) messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("unsloth/DeepSeek-R1-GGUF-UD", trust_remote_code=True) model = AutoModelForCausalLM.from_pretrained("unsloth/DeepSeek-R1-GGUF-UD", trust_remote_code=True) - llama-cpp-python
How to use unsloth/DeepSeek-R1-GGUF-UD with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="unsloth/DeepSeek-R1-GGUF-UD", filename="BF16/DeepSeek-R1-BF16-00001-of-00030.gguf", )
llm.create_chat_completion( messages = [ { "role": "user", "content": "What is the capital of France?" } ] ) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- llama.cpp
How to use unsloth/DeepSeek-R1-GGUF-UD with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf unsloth/DeepSeek-R1-GGUF-UD:UD-Q4_K_XL # Run inference directly in the terminal: llama-cli -hf unsloth/DeepSeek-R1-GGUF-UD:UD-Q4_K_XL
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf unsloth/DeepSeek-R1-GGUF-UD:UD-Q4_K_XL # Run inference directly in the terminal: llama-cli -hf unsloth/DeepSeek-R1-GGUF-UD:UD-Q4_K_XL
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf unsloth/DeepSeek-R1-GGUF-UD:UD-Q4_K_XL # Run inference directly in the terminal: ./llama-cli -hf unsloth/DeepSeek-R1-GGUF-UD:UD-Q4_K_XL
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf unsloth/DeepSeek-R1-GGUF-UD:UD-Q4_K_XL # Run inference directly in the terminal: ./build/bin/llama-cli -hf unsloth/DeepSeek-R1-GGUF-UD:UD-Q4_K_XL
Use Docker
docker model run hf.co/unsloth/DeepSeek-R1-GGUF-UD:UD-Q4_K_XL
- LM Studio
- Jan
- vLLM
How to use unsloth/DeepSeek-R1-GGUF-UD with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "unsloth/DeepSeek-R1-GGUF-UD" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "unsloth/DeepSeek-R1-GGUF-UD", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/unsloth/DeepSeek-R1-GGUF-UD:UD-Q4_K_XL
- SGLang
How to use unsloth/DeepSeek-R1-GGUF-UD with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "unsloth/DeepSeek-R1-GGUF-UD" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "unsloth/DeepSeek-R1-GGUF-UD", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "unsloth/DeepSeek-R1-GGUF-UD" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "unsloth/DeepSeek-R1-GGUF-UD", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Ollama
How to use unsloth/DeepSeek-R1-GGUF-UD with Ollama:
ollama run hf.co/unsloth/DeepSeek-R1-GGUF-UD:UD-Q4_K_XL
- Unsloth Studio
How to use unsloth/DeepSeek-R1-GGUF-UD with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for unsloth/DeepSeek-R1-GGUF-UD to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for unsloth/DeepSeek-R1-GGUF-UD to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for unsloth/DeepSeek-R1-GGUF-UD to start chatting
- Docker Model Runner
How to use unsloth/DeepSeek-R1-GGUF-UD with Docker Model Runner:
docker model run hf.co/unsloth/DeepSeek-R1-GGUF-UD:UD-Q4_K_XL
- Lemonade
How to use unsloth/DeepSeek-R1-GGUF-UD with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull unsloth/DeepSeek-R1-GGUF-UD:UD-Q4_K_XL
Run and chat with the model
lemonade run user.DeepSeek-R1-GGUF-UD-UD-Q4_K_XL
List all available models
lemonade list
Add files using upload-large-folder tool
Browse files- UD-IQ1_M/DeepSeek-R1-UD-IQ1_M-00001-of-00005.gguf +2 -2
- UD-IQ1_M/DeepSeek-R1-UD-IQ1_M-00002-of-00005.gguf +2 -2
- UD-IQ1_M/DeepSeek-R1-UD-IQ1_M-00003-of-00005.gguf +2 -2
- UD-IQ1_M/DeepSeek-R1-UD-IQ1_M-00004-of-00005.gguf +2 -2
- UD-IQ2_XXS/DeepSeek-R1-UD-IQ2_XXS-00001-of-00005.gguf +2 -2
- UD-IQ2_XXS/DeepSeek-R1-UD-IQ2_XXS-00002-of-00005.gguf +2 -2
- UD-IQ2_XXS/DeepSeek-R1-UD-IQ2_XXS-00003-of-00005.gguf +2 -2
- UD-IQ2_XXS/DeepSeek-R1-UD-IQ2_XXS-00004-of-00005.gguf +2 -2
- UD-IQ3_XXS/DeepSeek-R1-UD-IQ3_XXS-00001-of-00006.gguf +2 -2
- UD-IQ3_XXS/DeepSeek-R1-UD-IQ3_XXS-00002-of-00006.gguf +2 -2
- UD-IQ3_XXS/DeepSeek-R1-UD-IQ3_XXS-00003-of-00006.gguf +2 -2
- UD-IQ3_XXS/DeepSeek-R1-UD-IQ3_XXS-00004-of-00006.gguf +2 -2
- UD-IQ3_XXS/DeepSeek-R1-UD-IQ3_XXS-00005-of-00006.gguf +2 -2
- UD-IQ3_XXS/DeepSeek-R1-UD-IQ3_XXS-00006-of-00006.gguf +2 -2
- UD-Q3_K_XL/DeepSeek-R1-UD-Q3_K_XL-00001-of-00007.gguf +2 -2
- UD-Q3_K_XL/DeepSeek-R1-UD-Q3_K_XL-00002-of-00007.gguf +2 -2
- UD-Q3_K_XL/DeepSeek-R1-UD-Q3_K_XL-00003-of-00007.gguf +2 -2
- UD-Q3_K_XL/DeepSeek-R1-UD-Q3_K_XL-00004-of-00007.gguf +2 -2
- UD-Q3_K_XL/DeepSeek-R1-UD-Q3_K_XL-00005-of-00007.gguf +2 -2
- UD-Q3_K_XL/DeepSeek-R1-UD-Q3_K_XL-00006-of-00007.gguf +2 -2
UD-IQ1_M/DeepSeek-R1-UD-IQ1_M-00001-of-00005.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:91978a1fb57393b1657e4b648f2269074df262b93c650633beea89fd87fb17de
|
| 3 |
+
size 49207689088
|
UD-IQ1_M/DeepSeek-R1-UD-IQ1_M-00002-of-00005.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:5bb89bfe940be2aad688872a63ed8761dac0950ecb8268af7b4de87821d5b03a
|
| 3 |
+
size 49336582784
|
UD-IQ1_M/DeepSeek-R1-UD-IQ1_M-00003-of-00005.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:bb254a57838317ebd81c4e90fcd85c08423b6cc531e3de5a95a3652e71d6b18a
|
| 3 |
+
size 49910431488
|
UD-IQ1_M/DeepSeek-R1-UD-IQ1_M-00004-of-00005.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b2d78eb97c5979640964380169673db3297de37821d95f617bec350f49176da9
|
| 3 |
+
size 49629275392
|
UD-IQ2_XXS/DeepSeek-R1-UD-IQ2_XXS-00001-of-00005.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f639ec7e9861df65e27a1797f6e8308d5d55694e53eaad4f239b16045aeadf74
|
| 3 |
+
size 49309593856
|
UD-IQ2_XXS/DeepSeek-R1-UD-IQ2_XXS-00002-of-00005.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:3349adca0a33634f7077709956bd4b2124d661b3ab6595ed5f3bf99e6d29cebf
|
| 3 |
+
size 49996888192
|
UD-IQ2_XXS/DeepSeek-R1-UD-IQ2_XXS-00003-of-00005.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:da3bf996f1dbc67bf193cb4977d3b48b0e2cb715474125a870c0ec8afcbb92c9
|
| 3 |
+
size 49541021152
|
UD-IQ2_XXS/DeepSeek-R1-UD-IQ2_XXS-00004-of-00005.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ea75d9a908f9dd9506e62f6ea31320bdd91d5f60f73cc2e2560d5b8a49c78ddf
|
| 3 |
+
size 49334400128
|
UD-IQ3_XXS/DeepSeek-R1-UD-IQ3_XXS-00001-of-00006.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6fb1306ea7cfe3cf7fb76938d17591bc72ed2827d318865bdc41faa90cd066df
|
| 3 |
+
size 48762082880
|
UD-IQ3_XXS/DeepSeek-R1-UD-IQ3_XXS-00002-of-00006.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:48d9062dab099e92b9430f5db1cb9fbf7d8bd91e5044c749cc555f434e3017ca
|
| 3 |
+
size 48831904704
|
UD-IQ3_XXS/DeepSeek-R1-UD-IQ3_XXS-00003-of-00006.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ce7981bb29bbb506f15f6f69f02f6d823e30315ebff6a271f47ae354b5f0828f
|
| 3 |
+
size 49223320896
|
UD-IQ3_XXS/DeepSeek-R1-UD-IQ3_XXS-00004-of-00006.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:be83058a2da8d72caffb863b7fa2547600fe27ff13eff034adb548dcc9c499fc
|
| 3 |
+
size 49435945152
|
UD-IQ3_XXS/DeepSeek-R1-UD-IQ3_XXS-00005-of-00006.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:7aef668161b72696727a701c8239cb78a7b9d56c18d1ff96a1b30bc9f1f73a49
|
| 3 |
+
size 49298068672
|
UD-IQ3_XXS/DeepSeek-R1-UD-IQ3_XXS-00006-of-00006.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:0dc3acb25e7318d622827053ffd3a6ddea3718cd8ba343e47d2b10d4865b3342
|
| 3 |
+
size 27046540256
|
UD-Q3_K_XL/DeepSeek-R1-UD-Q3_K_XL-00001-of-00007.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:046beb5fbf1537c347233b74301b3095ddd222a049c69b6179fbe66984bb53f9
|
| 3 |
+
size 48277423296
|
UD-Q3_K_XL/DeepSeek-R1-UD-Q3_K_XL-00002-of-00007.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2d67223ee8689c387b254b4379dcd51c6aca0b33c56bcd7aa0008bfbf7e8780d
|
| 3 |
+
size 48678887232
|
UD-Q3_K_XL/DeepSeek-R1-UD-Q3_K_XL-00003-of-00007.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:bb1fb5bbd3b561c799fa2c3428de23f07049aea667dfc97131aa15844c8f4783
|
| 3 |
+
size 48786554144
|
UD-Q3_K_XL/DeepSeek-R1-UD-Q3_K_XL-00004-of-00007.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:24a53fd71fca7a6b05ba89570173974213425818d4de9caa685de969a7f2baf6
|
| 3 |
+
size 48811011488
|
UD-Q3_K_XL/DeepSeek-R1-UD-Q3_K_XL-00005-of-00007.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:45262014866dfb270869c39ee8d3cb9da5dcf16d464d7f945fea05a171c41af6
|
| 3 |
+
size 49972030496
|
UD-Q3_K_XL/DeepSeek-R1-UD-Q3_K_XL-00006-of-00007.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:49813c5ff2a53f4c266b5b30b29a0ad0ea381808b86710dbbdf4c6fff40b6a7a
|
| 3 |
+
size 49791741248
|