Instructions to use tensorblock/tsum_base-GGUF with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use tensorblock/tsum_base-GGUF with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("tensorblock/tsum_base-GGUF", dtype="auto") - llama-cpp-python
How to use tensorblock/tsum_base-GGUF with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="tensorblock/tsum_base-GGUF", filename="tsum_base-Q2_K.gguf", )
llm.create_chat_completion( messages = "No input example has been defined for this model task." )
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- llama.cpp
How to use tensorblock/tsum_base-GGUF with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf tensorblock/tsum_base-GGUF:Q2_K # Run inference directly in the terminal: llama-cli -hf tensorblock/tsum_base-GGUF:Q2_K
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf tensorblock/tsum_base-GGUF:Q2_K # Run inference directly in the terminal: llama-cli -hf tensorblock/tsum_base-GGUF:Q2_K
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf tensorblock/tsum_base-GGUF:Q2_K # Run inference directly in the terminal: ./llama-cli -hf tensorblock/tsum_base-GGUF:Q2_K
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf tensorblock/tsum_base-GGUF:Q2_K # Run inference directly in the terminal: ./build/bin/llama-cli -hf tensorblock/tsum_base-GGUF:Q2_K
Use Docker
docker model run hf.co/tensorblock/tsum_base-GGUF:Q2_K
- LM Studio
- Jan
- Ollama
How to use tensorblock/tsum_base-GGUF with Ollama:
ollama run hf.co/tensorblock/tsum_base-GGUF:Q2_K
- Unsloth Studio new
How to use tensorblock/tsum_base-GGUF with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for tensorblock/tsum_base-GGUF to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for tensorblock/tsum_base-GGUF to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for tensorblock/tsum_base-GGUF to start chatting
- Docker Model Runner
How to use tensorblock/tsum_base-GGUF with Docker Model Runner:
docker model run hf.co/tensorblock/tsum_base-GGUF:Q2_K
- Lemonade
How to use tensorblock/tsum_base-GGUF with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull tensorblock/tsum_base-GGUF:Q2_K
Run and chat with the model
lemonade run user.tsum_base-GGUF-Q2_K
List all available models
lemonade list
Remove .gguf files (keep Q2_K.gguf)
Browse files- tsum_base-Q3_K_L.gguf +0 -3
- tsum_base-Q3_K_M.gguf +0 -3
- tsum_base-Q3_K_S.gguf +0 -3
- tsum_base-Q4_0.gguf +0 -3
- tsum_base-Q4_K_M.gguf +0 -3
- tsum_base-Q4_K_S.gguf +0 -3
- tsum_base-Q5_0.gguf +0 -3
- tsum_base-Q5_K_M.gguf +0 -3
- tsum_base-Q5_K_S.gguf +0 -3
- tsum_base-Q6_K.gguf +0 -3
- tsum_base-Q8_0.gguf +0 -3
tsum_base-Q3_K_L.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:27d290583ddfb33ab1f2485d8e2e695b03df4c9b7b73fdcd02248ba7764fe70c
|
| 3 |
-
size 4321960992
|
|
|
|
|
|
|
|
|
|
|
|
tsum_base-Q3_K_M.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:c67dfe0eca57f4588f2845d6e887dcb1b8fbc418dbdfbe211bb969392cb9978a
|
| 3 |
-
size 4018922528
|
|
|
|
|
|
|
|
|
|
|
|
tsum_base-Q3_K_S.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:0c9a591043e713a663c6bceace128dae567510d96a4170259eb20f1cfbfb35ce
|
| 3 |
-
size 3664503840
|
|
|
|
|
|
|
|
|
|
|
|
tsum_base-Q4_0.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:d2d126a3e81ea3fd151afe2ff3011f25065f03c883ae69f0f83371d4d6532a30
|
| 3 |
-
size 4661216288
|
|
|
|
|
|
|
|
|
|
|
|
tsum_base-Q4_K_M.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:e8f131655fbb67d0305dc2cc02ef049ce0476e449032a827eebc6a569718ed18
|
| 3 |
-
size 4920738848
|
|
|
|
|
|
|
|
|
|
|
|
tsum_base-Q4_K_S.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:8453a872b5bb1ae2493700ca6889bc1e5681f55bae32feacd1fa2bc57de50891
|
| 3 |
-
size 4692673568
|
|
|
|
|
|
|
|
|
|
|
|
tsum_base-Q5_0.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:b3adc71c6029a9681549468c4821dabe23f8cddae8182ed2ec94b335e0903f9f
|
| 3 |
-
size 5599298592
|
|
|
|
|
|
|
|
|
|
|
|
tsum_base-Q5_K_M.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:c968d0b664b9ec6f757020d6f31719289cfe9f402d8b56fc6359f3c6e4ed6ce0
|
| 3 |
-
size 5732992032
|
|
|
|
|
|
|
|
|
|
|
|
tsum_base-Q5_K_S.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:adac3406206e9755e59b8df5f29b1f89fcf72daa07e9421fbf4af2aa78a2e1fe
|
| 3 |
-
size 5599298592
|
|
|
|
|
|
|
|
|
|
|
|
tsum_base-Q6_K.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:f15ee081006c470f50a1d85e95b112fbb871ee63f4f4df3f73cb13c23e554fdd
|
| 3 |
-
size 6596011040
|
|
|
|
|
|
|
|
|
|
|
|
tsum_base-Q8_0.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:a09599b885d6cc98198749c1a3d9df0213fc18e8b8cb4da4fd7436f8006b6673
|
| 3 |
-
size 8540775456
|
|
|
|
|
|
|
|
|
|
|
|