Instructions to use tensorblock/starchat-beta-GGUF with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- llama-cpp-python
How to use tensorblock/starchat-beta-GGUF with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="tensorblock/starchat-beta-GGUF", filename="starchat-beta-Q2_K.gguf", )
output = llm( "Once upon a time,", max_tokens=512, echo=True ) print(output)
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- llama.cpp
How to use tensorblock/starchat-beta-GGUF with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf tensorblock/starchat-beta-GGUF:Q2_K # Run inference directly in the terminal: llama-cli -hf tensorblock/starchat-beta-GGUF:Q2_K
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf tensorblock/starchat-beta-GGUF:Q2_K # Run inference directly in the terminal: llama-cli -hf tensorblock/starchat-beta-GGUF:Q2_K
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf tensorblock/starchat-beta-GGUF:Q2_K # Run inference directly in the terminal: ./llama-cli -hf tensorblock/starchat-beta-GGUF:Q2_K
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf tensorblock/starchat-beta-GGUF:Q2_K # Run inference directly in the terminal: ./build/bin/llama-cli -hf tensorblock/starchat-beta-GGUF:Q2_K
Use Docker
docker model run hf.co/tensorblock/starchat-beta-GGUF:Q2_K
- LM Studio
- Jan
- Ollama
How to use tensorblock/starchat-beta-GGUF with Ollama:
ollama run hf.co/tensorblock/starchat-beta-GGUF:Q2_K
- Unsloth Studio new
How to use tensorblock/starchat-beta-GGUF with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for tensorblock/starchat-beta-GGUF to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for tensorblock/starchat-beta-GGUF to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for tensorblock/starchat-beta-GGUF to start chatting
- Docker Model Runner
How to use tensorblock/starchat-beta-GGUF with Docker Model Runner:
docker model run hf.co/tensorblock/starchat-beta-GGUF:Q2_K
- Lemonade
How to use tensorblock/starchat-beta-GGUF with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull tensorblock/starchat-beta-GGUF:Q2_K
Run and chat with the model
lemonade run user.starchat-beta-GGUF-Q2_K
List all available models
lemonade list
Remove .gguf files (keep Q2_K.gguf)
Browse files- starchat-beta-Q3_K_L.gguf +0 -3
- starchat-beta-Q3_K_M.gguf +0 -3
- starchat-beta-Q3_K_S.gguf +0 -3
- starchat-beta-Q4_0.gguf +0 -3
- starchat-beta-Q4_K_M.gguf +0 -3
- starchat-beta-Q4_K_S.gguf +0 -3
- starchat-beta-Q5_0.gguf +0 -3
- starchat-beta-Q5_K_M.gguf +0 -3
- starchat-beta-Q5_K_S.gguf +0 -3
- starchat-beta-Q6_K.gguf +0 -3
- starchat-beta-Q8_0.gguf +0 -3
starchat-beta-Q3_K_L.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:3fa00393b5ddec9b530052004e3d3a1c0372bb4711b9ed0cb5f08944ae60aea6
|
| 3 |
-
size 9132135968
|
|
|
|
|
|
|
|
|
|
|
|
starchat-beta-Q3_K_M.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:44f4ffcefdf19c22b17e4be661bebfc3734ed5c7c928954fd2a690225ea25eb6
|
| 3 |
-
size 8226166304
|
|
|
|
|
|
|
|
|
|
|
|
starchat-beta-Q3_K_S.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:a712239d435cf5d1ceef174950b7bc56f41979a534d5a2492350906d87655074
|
| 3 |
-
size 6976820768
|
|
|
|
|
|
|
|
|
|
|
|
starchat-beta-Q4_0.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:5208aa19dddcb75908a06d76f1334b653fe0e16f83980b8856b42b93bb30d7e7
|
| 3 |
-
size 8990578208
|
|
|
|
|
|
|
|
|
|
|
|
starchat-beta-Q4_K_M.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:2b8f70bd6c3f6ca7a21a09c43e321e1856374a3aea3d4a88775f1e0892d00eb7
|
| 3 |
-
size 9965753888
|
|
|
|
|
|
|
|
|
|
|
|
starchat-beta-Q4_K_S.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:a3600df710d82fe1e4976fdd8aaea8202c5ed00e66a027bc76450e44a7824f15
|
| 3 |
-
size 9084950048
|
|
|
|
|
|
|
|
|
|
|
|
starchat-beta-Q5_0.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:4f30f4cf34d3c4eb08da822df61dffe9644fc11919489eaf6ccd2cbafd2baeb8
|
| 3 |
-
size 10885879328
|
|
|
|
|
|
|
|
|
|
|
|
starchat-beta-Q5_K_M.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:b2c8144234cb739a2f0f2b44a954302b3b273ec6c6aee790a62113045973b7be
|
| 3 |
-
size 11495855648
|
|
|
|
|
|
|
|
|
|
|
|
starchat-beta-Q5_K_S.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:e0712cc53631ea7940db5466cb22e4a25dd63d8339fb4e8267b44e320f51f20d
|
| 3 |
-
size 10885879328
|
|
|
|
|
|
|
|
|
|
|
|
starchat-beta-Q6_K.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:0461aaa7490c3d9c82ef25a70d0a4f0489f0f0af5deb7762f5f0522d1af50e9b
|
| 3 |
-
size 12899636768
|
|
|
|
|
|
|
|
|
|
|
|
starchat-beta-Q8_0.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:ad083b79281c74bcd29544a7f79d6d26ac685f6f65e78570d6249a75c7284be7
|
| 3 |
-
size 16644926816
|
|
|
|
|
|
|
|
|
|
|
|