Instructions to use unsloth/DeepSeek-R1-GGUF-UD with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use unsloth/DeepSeek-R1-GGUF-UD with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="unsloth/DeepSeek-R1-GGUF-UD", trust_remote_code=True) messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("unsloth/DeepSeek-R1-GGUF-UD", trust_remote_code=True) model = AutoModelForCausalLM.from_pretrained("unsloth/DeepSeek-R1-GGUF-UD", trust_remote_code=True) - llama-cpp-python
How to use unsloth/DeepSeek-R1-GGUF-UD with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="unsloth/DeepSeek-R1-GGUF-UD", filename="BF16/DeepSeek-R1-BF16-00001-of-00030.gguf", )
llm.create_chat_completion( messages = [ { "role": "user", "content": "What is the capital of France?" } ] ) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- llama.cpp
How to use unsloth/DeepSeek-R1-GGUF-UD with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf unsloth/DeepSeek-R1-GGUF-UD:UD-Q4_K_XL # Run inference directly in the terminal: llama-cli -hf unsloth/DeepSeek-R1-GGUF-UD:UD-Q4_K_XL
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf unsloth/DeepSeek-R1-GGUF-UD:UD-Q4_K_XL # Run inference directly in the terminal: llama-cli -hf unsloth/DeepSeek-R1-GGUF-UD:UD-Q4_K_XL
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf unsloth/DeepSeek-R1-GGUF-UD:UD-Q4_K_XL # Run inference directly in the terminal: ./llama-cli -hf unsloth/DeepSeek-R1-GGUF-UD:UD-Q4_K_XL
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf unsloth/DeepSeek-R1-GGUF-UD:UD-Q4_K_XL # Run inference directly in the terminal: ./build/bin/llama-cli -hf unsloth/DeepSeek-R1-GGUF-UD:UD-Q4_K_XL
Use Docker
docker model run hf.co/unsloth/DeepSeek-R1-GGUF-UD:UD-Q4_K_XL
- LM Studio
- Jan
- vLLM
How to use unsloth/DeepSeek-R1-GGUF-UD with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "unsloth/DeepSeek-R1-GGUF-UD" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "unsloth/DeepSeek-R1-GGUF-UD", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/unsloth/DeepSeek-R1-GGUF-UD:UD-Q4_K_XL
- SGLang
How to use unsloth/DeepSeek-R1-GGUF-UD with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "unsloth/DeepSeek-R1-GGUF-UD" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "unsloth/DeepSeek-R1-GGUF-UD", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "unsloth/DeepSeek-R1-GGUF-UD" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "unsloth/DeepSeek-R1-GGUF-UD", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Ollama
How to use unsloth/DeepSeek-R1-GGUF-UD with Ollama:
ollama run hf.co/unsloth/DeepSeek-R1-GGUF-UD:UD-Q4_K_XL
- Unsloth Studio
How to use unsloth/DeepSeek-R1-GGUF-UD with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for unsloth/DeepSeek-R1-GGUF-UD to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for unsloth/DeepSeek-R1-GGUF-UD to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for unsloth/DeepSeek-R1-GGUF-UD to start chatting
- Docker Model Runner
How to use unsloth/DeepSeek-R1-GGUF-UD with Docker Model Runner:
docker model run hf.co/unsloth/DeepSeek-R1-GGUF-UD:UD-Q4_K_XL
- Lemonade
How to use unsloth/DeepSeek-R1-GGUF-UD with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull unsloth/DeepSeek-R1-GGUF-UD:UD-Q4_K_XL
Run and chat with the model
lemonade run user.DeepSeek-R1-GGUF-UD-UD-Q4_K_XL
List all available models
lemonade list
Add files using upload-large-folder tool
Browse files- UD-IQ1_M/DeepSeek-R1-UD-IQ1_M-00001-of-00005.gguf +2 -2
- UD-IQ1_M/DeepSeek-R1-UD-IQ1_M-00002-of-00005.gguf +2 -2
- UD-IQ1_M/DeepSeek-R1-UD-IQ1_M-00003-of-00005.gguf +2 -2
- UD-IQ1_M/DeepSeek-R1-UD-IQ1_M-00004-of-00005.gguf +2 -2
- UD-IQ1_M/DeepSeek-R1-UD-IQ1_M-00005-of-00005.gguf +2 -2
- UD-IQ2_XXS/DeepSeek-R1-UD-IQ2_XXS-00001-of-00005.gguf +2 -2
- UD-IQ2_XXS/DeepSeek-R1-UD-IQ2_XXS-00002-of-00005.gguf +2 -2
- UD-IQ2_XXS/DeepSeek-R1-UD-IQ2_XXS-00003-of-00005.gguf +2 -2
- UD-IQ2_XXS/DeepSeek-R1-UD-IQ2_XXS-00004-of-00005.gguf +2 -2
- UD-IQ2_XXS/DeepSeek-R1-UD-IQ2_XXS-00005-of-00005.gguf +2 -2
- UD-IQ3_XXS/DeepSeek-R1-UD-IQ3_XXS-00001-of-00006.gguf +2 -2
- UD-IQ3_XXS/DeepSeek-R1-UD-IQ3_XXS-00002-of-00006.gguf +2 -2
- UD-IQ3_XXS/DeepSeek-R1-UD-IQ3_XXS-00003-of-00006.gguf +2 -2
- UD-IQ3_XXS/DeepSeek-R1-UD-IQ3_XXS-00004-of-00006.gguf +2 -2
- UD-IQ3_XXS/DeepSeek-R1-UD-IQ3_XXS-00005-of-00006.gguf +2 -2
- UD-IQ3_XXS/DeepSeek-R1-UD-IQ3_XXS-00006-of-00006.gguf +2 -2
- UD-Q3_K_XL/DeepSeek-R1-UD-Q3_K_XL-00001-of-00007.gguf +2 -2
- UD-Q3_K_XL/DeepSeek-R1-UD-Q3_K_XL-00002-of-00007.gguf +2 -2
- UD-Q3_K_XL/DeepSeek-R1-UD-Q3_K_XL-00003-of-00007.gguf +2 -2
- UD-Q3_K_XL/DeepSeek-R1-UD-Q3_K_XL-00004-of-00007.gguf +2 -2
- UD-Q3_K_XL/DeepSeek-R1-UD-Q3_K_XL-00005-of-00007.gguf +2 -2
- UD-Q3_K_XL/DeepSeek-R1-UD-Q3_K_XL-00006-of-00007.gguf +2 -2
UD-IQ1_M/DeepSeek-R1-UD-IQ1_M-00001-of-00005.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:784411690e89a5778d15a391aa244388532289a2676d00d22a70216b46737b7d
|
| 3 |
+
size 49981619360
|
UD-IQ1_M/DeepSeek-R1-UD-IQ1_M-00002-of-00005.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:90b1240389b36746f00ca13b43b94241940470eaa2dc7fd405f6f4c424d4400d
|
| 3 |
+
size 48961656704
|
UD-IQ1_M/DeepSeek-R1-UD-IQ1_M-00003-of-00005.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:8e4dbea7f7b8c78e5ff9925caeb7ac150542d1fa616171cf8646df3421660b66
|
| 3 |
+
size 49327575552
|
UD-IQ1_M/DeepSeek-R1-UD-IQ1_M-00004-of-00005.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:63010c5c569d30c928c7f8cb6b8812b44ba8adc2319e9a48359198f326402ad5
|
| 3 |
+
size 48484681600
|
UD-IQ1_M/DeepSeek-R1-UD-IQ1_M-00005-of-00005.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a1d93862e97967190fb69f78b30e5fba2d19525ba9aa317a4cd1a8ebc5b43b73
|
| 3 |
+
size 3970183840
|
UD-IQ2_XXS/DeepSeek-R1-UD-IQ2_XXS-00001-of-00005.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6e44219da9b349adfb4a7a5f4bd844244d212f9b6d6e50354b3af5037c2b1ce8
|
| 3 |
+
size 49887316736
|
UD-IQ2_XXS/DeepSeek-R1-UD-IQ2_XXS-00002-of-00005.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:650e6f6abd1739421b63764d5d628e4b967a5a243a807c2331b1098e9464c209
|
| 3 |
+
size 49987913856
|
UD-IQ2_XXS/DeepSeek-R1-UD-IQ2_XXS-00003-of-00005.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f6e9da9560b3f5ec2a881d56a810944d0a7ddb1bbd746ad06c2ce4289c265358
|
| 3 |
+
size 49489479648
|
UD-IQ2_XXS/DeepSeek-R1-UD-IQ2_XXS-00004-of-00005.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:0b9613dff5a58cb84940e6185c251928b0f9863dfae8d5dafa4deffe64f23bfa
|
| 3 |
+
size 49256204416
|
UD-IQ2_XXS/DeepSeek-R1-UD-IQ2_XXS-00005-of-00005.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:44298ffaa500087d4e046d7218d158f65a2a4ab23fda93ffa24e7e0120291598
|
| 3 |
+
size 17513817184
|
UD-IQ3_XXS/DeepSeek-R1-UD-IQ3_XXS-00001-of-00006.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:9c582844f536d0d7f28ee12585c307790adc3ffef3f99a6f1034b21393ab0fb4
|
| 3 |
+
size 48700095040
|
UD-IQ3_XXS/DeepSeek-R1-UD-IQ3_XXS-00002-of-00006.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a9d4fc11d0d1c654e973ac520719d94d22da70376c4c5bb5d33e405050372e6c
|
| 3 |
+
size 48824948160
|
UD-IQ3_XXS/DeepSeek-R1-UD-IQ3_XXS-00003-of-00006.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b91753a9c30d58a3481d9e5395e5a2f000ae104818878cbd05b2aace51f7e6a3
|
| 3 |
+
size 48521059872
|
UD-IQ3_XXS/DeepSeek-R1-UD-IQ3_XXS-00004-of-00006.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2d6a6e0e3bc93122a347c9e750d230c4743c0c8633978f2682812de3ee4baf62
|
| 3 |
+
size 49553819328
|
UD-IQ3_XXS/DeepSeek-R1-UD-IQ3_XXS-00005-of-00006.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:46f71e41230785c475394d2f7e98924d7a6e6bd382e48583df283d6c7c66a979
|
| 3 |
+
size 49215260352
|
UD-IQ3_XXS/DeepSeek-R1-UD-IQ3_XXS-00006-of-00006.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b99ff99fbdce61e55419a14486c1a5f3c7fb4c68ddbdc72915a9ac23d8b20b63
|
| 3 |
+
size 28501386464
|
UD-Q3_K_XL/DeepSeek-R1-UD-Q3_K_XL-00001-of-00007.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:0ff216ea0e97852f5b10b22ae61b78fa47eeb5bb902d5142eadffc6633c127d1
|
| 3 |
+
size 48232198336
|
UD-Q3_K_XL/DeepSeek-R1-UD-Q3_K_XL-00002-of-00007.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:fcbb3fc3a184839f0bc94d3aaa0f8048a4f9f1632c71be01eff633bc17341fa1
|
| 3 |
+
size 48670371648
|
UD-Q3_K_XL/DeepSeek-R1-UD-Q3_K_XL-00003-of-00007.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e9e48983d6ecf0b5a858db867251042a52a3c6b12db075ceed4461ee90611713
|
| 3 |
+
size 49275684128
|
UD-Q3_K_XL/DeepSeek-R1-UD-Q3_K_XL-00004-of-00007.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ec1921409c38198d675e18fb72eb8e5a782fb1a58774f3022d8d076023a39fdd
|
| 3 |
+
size 48812905888
|
UD-Q3_K_XL/DeepSeek-R1-UD-Q3_K_XL-00005-of-00007.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:4544b0cb3c31f18c35893677ed6a9c875f02f9794fa3433d5a470b15801cb685
|
| 3 |
+
size 49921291808
|
UD-Q3_K_XL/DeepSeek-R1-UD-Q3_K_XL-00006-of-00007.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:0c96ce047d6fb2f1f872de1cf97d6d66ae590f0ab459fb9010ee69fdaa27e3d1
|
| 3 |
+
size 49726336832
|