Instructions to use tensorblock/DaringMaid-13B-GGUF with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- llama-cpp-python
How to use tensorblock/DaringMaid-13B-GGUF with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="tensorblock/DaringMaid-13B-GGUF", filename="DaringMaid-13B-Q2_K.gguf", )
output = llm( "Once upon a time,", max_tokens=512, echo=True ) print(output)
- Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- llama.cpp
How to use tensorblock/DaringMaid-13B-GGUF with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf tensorblock/DaringMaid-13B-GGUF:Q2_K # Run inference directly in the terminal: llama-cli -hf tensorblock/DaringMaid-13B-GGUF:Q2_K
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf tensorblock/DaringMaid-13B-GGUF:Q2_K # Run inference directly in the terminal: llama-cli -hf tensorblock/DaringMaid-13B-GGUF:Q2_K
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf tensorblock/DaringMaid-13B-GGUF:Q2_K # Run inference directly in the terminal: ./llama-cli -hf tensorblock/DaringMaid-13B-GGUF:Q2_K
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf tensorblock/DaringMaid-13B-GGUF:Q2_K # Run inference directly in the terminal: ./build/bin/llama-cli -hf tensorblock/DaringMaid-13B-GGUF:Q2_K
Use Docker
docker model run hf.co/tensorblock/DaringMaid-13B-GGUF:Q2_K
- LM Studio
- Jan
- vLLM
How to use tensorblock/DaringMaid-13B-GGUF with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "tensorblock/DaringMaid-13B-GGUF" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "tensorblock/DaringMaid-13B-GGUF", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/tensorblock/DaringMaid-13B-GGUF:Q2_K
- Ollama
How to use tensorblock/DaringMaid-13B-GGUF with Ollama:
ollama run hf.co/tensorblock/DaringMaid-13B-GGUF:Q2_K
- Unsloth Studio
How to use tensorblock/DaringMaid-13B-GGUF with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for tensorblock/DaringMaid-13B-GGUF to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for tensorblock/DaringMaid-13B-GGUF to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for tensorblock/DaringMaid-13B-GGUF to start chatting
- Docker Model Runner
How to use tensorblock/DaringMaid-13B-GGUF with Docker Model Runner:
docker model run hf.co/tensorblock/DaringMaid-13B-GGUF:Q2_K
- Lemonade
How to use tensorblock/DaringMaid-13B-GGUF with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull tensorblock/DaringMaid-13B-GGUF:Q2_K
Run and chat with the model
lemonade run user.DaringMaid-13B-GGUF-Q2_K
List all available models
lemonade list
Remove .gguf files (keep Q2_K.gguf)
Browse files- DaringMaid-13B-Q3_K_L.gguf +0 -3
- DaringMaid-13B-Q3_K_M.gguf +0 -3
- DaringMaid-13B-Q3_K_S.gguf +0 -3
- DaringMaid-13B-Q4_0.gguf +0 -3
- DaringMaid-13B-Q4_K_M.gguf +0 -3
- DaringMaid-13B-Q4_K_S.gguf +0 -3
- DaringMaid-13B-Q5_0.gguf +0 -3
- DaringMaid-13B-Q5_K_M.gguf +0 -3
- DaringMaid-13B-Q5_K_S.gguf +0 -3
- DaringMaid-13B-Q6_K.gguf +0 -3
- DaringMaid-13B-Q8_0.gguf +0 -3
DaringMaid-13B-Q3_K_L.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:df08d2648f142dfd93af0738a1a7ffe5b4fd7fb66bfef18cbc17923e88b16b6e
|
| 3 |
-
size 6929559904
|
|
|
|
|
|
|
|
|
|
|
|
DaringMaid-13B-Q3_K_M.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:36f9a6712c80ec993b3bc2f3cb4061399d0f6c984c03c66abf9adb5d1db4baf9
|
| 3 |
-
size 6337769824
|
|
|
|
|
|
|
|
|
|
|
|
DaringMaid-13B-Q3_K_S.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:4be9b2133ea2c24a199107cfb25986fa8c2b5137191e6f16bac0fc9d07e45b67
|
| 3 |
-
size 5658980704
|
|
|
|
|
|
|
|
|
|
|
|
DaringMaid-13B-Q4_0.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:4f46e82d499be6789f5310dbde2163571ba7afd5723181700fa12d67f8389cf4
|
| 3 |
-
size 7365835104
|
|
|
|
|
|
|
|
|
|
|
|
DaringMaid-13B-Q4_K_M.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:0340c137ae2ba3acca8e8b14bc5bea521c8397f1498913270a59fe5584672ce7
|
| 3 |
-
size 7865956704
|
|
|
|
|
|
|
|
|
|
|
|
DaringMaid-13B-Q4_K_S.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:da6ca9f804c9ee9e38c10b82f4c4c7eff76fbef80ce71032550d6a52cf26e50b
|
| 3 |
-
size 7423179104
|
|
|
|
|
|
|
|
|
|
|
|
DaringMaid-13B-Q5_0.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:c650228d7fa68d129dc8858b0d9a0c70248fac8deaa225104f066ac43d12ae25
|
| 3 |
-
size 8972286304
|
|
|
|
|
|
|
|
|
|
|
|
DaringMaid-13B-Q5_K_M.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:81335b659715c31656d2bdd7a3301fbcc07031aa557bffa16708ee307312b0a9
|
| 3 |
-
size 9229924704
|
|
|
|
|
|
|
|
|
|
|
|
DaringMaid-13B-Q5_K_S.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:f032169e849c9d58377d390108deb37ba0db3ae19c024316fc01698c419953ed
|
| 3 |
-
size 8972286304
|
|
|
|
|
|
|
|
|
|
|
|
DaringMaid-13B-Q6_K.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:cf84628478c147627b9e66d7457130a074a029ac765ee0684a4ccb197e2a4a09
|
| 3 |
-
size 10679140704
|
|
|
|
|
|
|
|
|
|
|
|
DaringMaid-13B-Q8_0.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:c6596750811b1bf031fd789a37b2580a1df745f33c0936caffc518005331df6a
|
| 3 |
-
size 13831319904
|
|
|
|
|
|
|
|
|
|
|
|