Instructions to use flpelerin/mlk-models with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- llama-cpp-python
How to use flpelerin/mlk-models with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="flpelerin/mlk-models", filename="run-0a23d4bc/run-0a23d4bc.q8_0.gguf", )
llm.create_chat_completion( messages = "No input example has been defined for this model task." )
- Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- llama.cpp
How to use flpelerin/mlk-models with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf flpelerin/mlk-models:Q8_0 # Run inference directly in the terminal: llama-cli -hf flpelerin/mlk-models:Q8_0
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf flpelerin/mlk-models:Q8_0 # Run inference directly in the terminal: llama-cli -hf flpelerin/mlk-models:Q8_0
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf flpelerin/mlk-models:Q8_0 # Run inference directly in the terminal: ./llama-cli -hf flpelerin/mlk-models:Q8_0
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf flpelerin/mlk-models:Q8_0 # Run inference directly in the terminal: ./build/bin/llama-cli -hf flpelerin/mlk-models:Q8_0
Use Docker
docker model run hf.co/flpelerin/mlk-models:Q8_0
- LM Studio
- Jan
- Ollama
How to use flpelerin/mlk-models with Ollama:
ollama run hf.co/flpelerin/mlk-models:Q8_0
- Unsloth Studio
How to use flpelerin/mlk-models with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for flpelerin/mlk-models to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for flpelerin/mlk-models to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for flpelerin/mlk-models to start chatting
- Atomic Chat new
- Docker Model Runner
How to use flpelerin/mlk-models with Docker Model Runner:
docker model run hf.co/flpelerin/mlk-models:Q8_0
- Lemonade
How to use flpelerin/mlk-models with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull flpelerin/mlk-models:Q8_0
Run and chat with the model
lemonade run user.mlk-models-Q8_0
List all available models
lemonade list
flpelerin commited on
Commit ·
d3b629d
1
Parent(s): cbdbf75
added run-999de574
Browse files- .gitattributes +1 -0
- run-999de574/merged/config.json +3 -0
- run-999de574/merged/generation_config.json +3 -0
- run-999de574/merged/pytorch_model.bin +3 -0
- run-999de574/merged/special_tokens_map.json +3 -0
- run-999de574/merged/tokenizer.json +3 -0
- run-999de574/merged/tokenizer.model +3 -0
- run-999de574/merged/tokenizer_config.json +3 -0
- run-999de574/run-999de574.q8_0.gguf +3 -0
.gitattributes
CHANGED
|
@@ -15,3 +15,4 @@ run-4f8286af/** filter=lfs diff=lfs merge=lfs -text
|
|
| 15 |
run-e65a5e1f/** filter=lfs diff=lfs merge=lfs -text
|
| 16 |
run-7c381d80/** filter=lfs diff=lfs merge=lfs -text
|
| 17 |
run-69727a9c/** filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
| 15 |
run-e65a5e1f/** filter=lfs diff=lfs merge=lfs -text
|
| 16 |
run-7c381d80/** filter=lfs diff=lfs merge=lfs -text
|
| 17 |
run-69727a9c/** filter=lfs diff=lfs merge=lfs -text
|
| 18 |
+
run-999de574/** filter=lfs diff=lfs merge=lfs -text
|
run-999de574/merged/config.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:195cdbb348872e0207ecc298dd81a52cfa283f6aaf34645cf5cbdf135c31a67b
|
| 3 |
+
size 717
|
run-999de574/merged/generation_config.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:7d7881c20389e36b19d8fcb153ea7ac06dff39cd883501d73b986012f4f3cbc8
|
| 3 |
+
size 145
|
run-999de574/merged/pytorch_model.bin
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:90f8d462e4b6ea6074876b4d01ec717b8883b7997744545d537daace22efa5fd
|
| 3 |
+
size 2200161902
|
run-999de574/merged/special_tokens_map.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:82d96d7a9e6ced037f12394b7ea6a5b02e6ca87e0d11edaa8d60d9be857ce7db
|
| 3 |
+
size 551
|
run-999de574/merged/tokenizer.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:bf467c9e0f536bda271283c6ef85eb1a943e3196b621c8a912d64953b205df83
|
| 3 |
+
size 1842795
|
run-999de574/merged/tokenizer.model
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:9e556afd44213b6bd1be2b850ebbbd98f5481437a8021afaf58ee7fb1818d347
|
| 3 |
+
size 499723
|
run-999de574/merged/tokenizer_config.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e6dddb0d77c54064c48f7280f9879e2286c1b547be9694b06c912522022d6933
|
| 3 |
+
size 1321
|
run-999de574/run-999de574.q8_0.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:47e609106e647e5968d4cbff9929dbb24836f493c8ddaf2bfe0e2caf324e5f79
|
| 3 |
+
size 1169808448
|