Instructions to use tensorblock/Kukedlc_LLama-3-8b-Python-GGUF with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- llama-cpp-python
How to use tensorblock/Kukedlc_LLama-3-8b-Python-GGUF with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="tensorblock/Kukedlc_LLama-3-8b-Python-GGUF", filename="LLama-3-8b-Python-Q2_K.gguf", )
llm.create_chat_completion( messages = "No input example has been defined for this model task." )
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- llama.cpp
How to use tensorblock/Kukedlc_LLama-3-8b-Python-GGUF with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf tensorblock/Kukedlc_LLama-3-8b-Python-GGUF:Q2_K # Run inference directly in the terminal: llama-cli -hf tensorblock/Kukedlc_LLama-3-8b-Python-GGUF:Q2_K
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf tensorblock/Kukedlc_LLama-3-8b-Python-GGUF:Q2_K # Run inference directly in the terminal: llama-cli -hf tensorblock/Kukedlc_LLama-3-8b-Python-GGUF:Q2_K
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf tensorblock/Kukedlc_LLama-3-8b-Python-GGUF:Q2_K # Run inference directly in the terminal: ./llama-cli -hf tensorblock/Kukedlc_LLama-3-8b-Python-GGUF:Q2_K
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf tensorblock/Kukedlc_LLama-3-8b-Python-GGUF:Q2_K # Run inference directly in the terminal: ./build/bin/llama-cli -hf tensorblock/Kukedlc_LLama-3-8b-Python-GGUF:Q2_K
Use Docker
docker model run hf.co/tensorblock/Kukedlc_LLama-3-8b-Python-GGUF:Q2_K
- LM Studio
- Jan
- Ollama
How to use tensorblock/Kukedlc_LLama-3-8b-Python-GGUF with Ollama:
ollama run hf.co/tensorblock/Kukedlc_LLama-3-8b-Python-GGUF:Q2_K
- Unsloth Studio new
How to use tensorblock/Kukedlc_LLama-3-8b-Python-GGUF with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for tensorblock/Kukedlc_LLama-3-8b-Python-GGUF to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for tensorblock/Kukedlc_LLama-3-8b-Python-GGUF to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for tensorblock/Kukedlc_LLama-3-8b-Python-GGUF to start chatting
- Docker Model Runner
How to use tensorblock/Kukedlc_LLama-3-8b-Python-GGUF with Docker Model Runner:
docker model run hf.co/tensorblock/Kukedlc_LLama-3-8b-Python-GGUF:Q2_K
- Lemonade
How to use tensorblock/Kukedlc_LLama-3-8b-Python-GGUF with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull tensorblock/Kukedlc_LLama-3-8b-Python-GGUF:Q2_K
Run and chat with the model
lemonade run user.Kukedlc_LLama-3-8b-Python-GGUF-Q2_K
List all available models
lemonade list
Remove .gguf files (keep Q2_K.gguf)
Browse files- LLama-3-8b-Python-Q3_K_L.gguf +0 -3
- LLama-3-8b-Python-Q3_K_M.gguf +0 -3
- LLama-3-8b-Python-Q3_K_S.gguf +0 -3
- LLama-3-8b-Python-Q4_0.gguf +0 -3
- LLama-3-8b-Python-Q4_K_M.gguf +0 -3
- LLama-3-8b-Python-Q4_K_S.gguf +0 -3
- LLama-3-8b-Python-Q5_0.gguf +0 -3
- LLama-3-8b-Python-Q5_K_M.gguf +0 -3
- LLama-3-8b-Python-Q5_K_S.gguf +0 -3
- LLama-3-8b-Python-Q6_K.gguf +0 -3
- LLama-3-8b-Python-Q8_0.gguf +0 -3
LLama-3-8b-Python-Q3_K_L.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:4a84324e06630bb019bbfafe9d1770bd41cab95858324b3402e0f927eff857ad
|
| 3 |
-
size 4321956448
|
|
|
|
|
|
|
|
|
|
|
|
LLama-3-8b-Python-Q3_K_M.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:42fe018b4bc5860ad255187ef6cf25e39ca3e37fe7a8ed88daa82772397986ea
|
| 3 |
-
size 4018917984
|
|
|
|
|
|
|
|
|
|
|
|
LLama-3-8b-Python-Q3_K_S.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:5d061df2e7dd40aa0036edfa662cc562491a5ce67778559735a26e4b7a4ce1b6
|
| 3 |
-
size 3664499296
|
|
|
|
|
|
|
|
|
|
|
|
LLama-3-8b-Python-Q4_0.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:5cd01adc7acd66566246daaf04f13eb250f0d2de7d6ab71465fced51f1f3f648
|
| 3 |
-
size 4661211744
|
|
|
|
|
|
|
|
|
|
|
|
LLama-3-8b-Python-Q4_K_M.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:0fe07b4173dc8e69a0b68a7a174555ba08011547e6524cbb3938752ae9deb7cf
|
| 3 |
-
size 4920734304
|
|
|
|
|
|
|
|
|
|
|
|
LLama-3-8b-Python-Q4_K_S.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:bb4987d38e17c423b150a3a9091f71114e2fc7637b2f621c6a1d6ff9f7a86094
|
| 3 |
-
size 4692669024
|
|
|
|
|
|
|
|
|
|
|
|
LLama-3-8b-Python-Q5_0.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:ea998709fca71b48aa263e78e655944e6334e799dd9b0e5778ac29effd8f4d99
|
| 3 |
-
size 5599294048
|
|
|
|
|
|
|
|
|
|
|
|
LLama-3-8b-Python-Q5_K_M.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:2fff51a33fb4da7dd466aab5f3c8ce444b04f5655379ac01b44d12299dbd6e7e
|
| 3 |
-
size 5732987488
|
|
|
|
|
|
|
|
|
|
|
|
LLama-3-8b-Python-Q5_K_S.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:f425497f2dc5212f04d0813a17c6c73ea6103e1ab6bb1d39e2b59153f191a04a
|
| 3 |
-
size 5599294048
|
|
|
|
|
|
|
|
|
|
|
|
LLama-3-8b-Python-Q6_K.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:e7dd8046fca771fb1630b1b833027f21eb98bd9f769be87f610ca73cda9cde3f
|
| 3 |
-
size 6596006496
|
|
|
|
|
|
|
|
|
|
|
|
LLama-3-8b-Python-Q8_0.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:33eae054428ebfe0ac577142724ae5c958a1fe183872e781a1cae9e46a5aee42
|
| 3 |
-
size 8540770912
|
|
|
|
|
|
|
|
|
|
|
|