Instructions to use tensorblock/slim-ner-GGUF with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- llama-cpp-python
How to use tensorblock/slim-ner-GGUF with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="tensorblock/slim-ner-GGUF", filename="slim-ner-Q2_K.gguf", )
output = llm( "Once upon a time,", max_tokens=512, echo=True ) print(output)
- Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- llama.cpp
How to use tensorblock/slim-ner-GGUF with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf tensorblock/slim-ner-GGUF:Q2_K # Run inference directly in the terminal: llama-cli -hf tensorblock/slim-ner-GGUF:Q2_K
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf tensorblock/slim-ner-GGUF:Q2_K # Run inference directly in the terminal: llama-cli -hf tensorblock/slim-ner-GGUF:Q2_K
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf tensorblock/slim-ner-GGUF:Q2_K # Run inference directly in the terminal: ./llama-cli -hf tensorblock/slim-ner-GGUF:Q2_K
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf tensorblock/slim-ner-GGUF:Q2_K # Run inference directly in the terminal: ./build/bin/llama-cli -hf tensorblock/slim-ner-GGUF:Q2_K
Use Docker
docker model run hf.co/tensorblock/slim-ner-GGUF:Q2_K
- LM Studio
- Jan
- Ollama
How to use tensorblock/slim-ner-GGUF with Ollama:
ollama run hf.co/tensorblock/slim-ner-GGUF:Q2_K
- Unsloth Studio
How to use tensorblock/slim-ner-GGUF with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for tensorblock/slim-ner-GGUF to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for tensorblock/slim-ner-GGUF to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for tensorblock/slim-ner-GGUF to start chatting
- Docker Model Runner
How to use tensorblock/slim-ner-GGUF with Docker Model Runner:
docker model run hf.co/tensorblock/slim-ner-GGUF:Q2_K
- Lemonade
How to use tensorblock/slim-ner-GGUF with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull tensorblock/slim-ner-GGUF:Q2_K
Run and chat with the model
lemonade run user.slim-ner-GGUF-Q2_K
List all available models
lemonade list
Remove .gguf files (keep Q2_K.gguf)
Browse files- slim-ner-Q3_K_L.gguf +0 -3
- slim-ner-Q3_K_M.gguf +0 -3
- slim-ner-Q3_K_S.gguf +0 -3
- slim-ner-Q4_0.gguf +0 -3
- slim-ner-Q4_K_M.gguf +0 -3
- slim-ner-Q4_K_S.gguf +0 -3
- slim-ner-Q5_0.gguf +0 -3
- slim-ner-Q5_K_M.gguf +0 -3
- slim-ner-Q5_K_S.gguf +0 -3
- slim-ner-Q6_K.gguf +0 -3
- slim-ner-Q8_0.gguf +0 -3
slim-ner-Q3_K_L.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:7fcb635667256f605960d99141d56c6adca324b3d89b760292c27f34c32fb32d
|
| 3 |
-
size 591526848
|
|
|
|
|
|
|
|
|
|
|
|
slim-ner-Q3_K_M.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:f723bdb449db45b7e22428bfd794502989bff13ca20643c1e5f9046e156bac93
|
| 3 |
-
size 548404160
|
|
|
|
|
|
|
|
|
|
|
|
slim-ner-Q3_K_S.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:3ddb7e73463d1b9e440f6c2ea353fad561e65e4660ceef3fe5aa15d499ceabab
|
| 3 |
-
size 499342272
|
|
|
|
|
|
|
|
|
|
|
|
slim-ner-Q4_0.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:25cfbb8291a583b55aa3cf29cf47d125f2a51edddff0fa735066dcea281158e9
|
| 3 |
-
size 636726208
|
|
|
|
|
|
|
|
|
|
|
|
slim-ner-Q4_K_M.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:37305a7c3755388218520466b2ab76050a53f352962032942d1f72462cfadecf
|
| 3 |
-
size 667814848
|
|
|
|
|
|
|
|
|
|
|
|
slim-ner-Q4_K_S.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:3e224c51323e9715a520c38eccc18ed7f531241b564a495c792faeb110852bfb
|
| 3 |
-
size 639871936
|
|
|
|
|
|
|
|
|
|
|
|
slim-ner-Q5_0.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:efbbf81a7d0c37ba7c15d7bb1c533cd2468071316173b231a7f426fa1b54fdbd
|
| 3 |
-
size 766028736
|
|
|
|
|
|
|
|
|
|
|
|
slim-ner-Q5_K_M.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:c61d83dbcbe9e16f17acca312e0b4c372ddb92d98258447f60b3212f0323dfb4
|
| 3 |
-
size 782044096
|
|
|
|
|
|
|
|
|
|
|
|
slim-ner-Q5_K_S.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:6eb1ec385166de64d87c787ce6662cb555f36994a2af131f47eda3e01bb1f684
|
| 3 |
-
size 766028736
|
|
|
|
|
|
|
|
|
|
|
|
slim-ner-Q6_K.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:6412a96e50ce9c839d31ff7393e3852554340d0edda42536d79ab172b88c382e
|
| 3 |
-
size 903412672
|
|
|
|
|
|
|
|
|
|
|
|
slim-ner-Q8_0.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:96cc9525e1d52ead8b5d5c94f8e9092bf83bb950a3331c8eb19a7ca36a3d9e69
|
| 3 |
-
size 1169808320
|
|
|
|
|
|
|
|
|
|
|
|