How to use from
llama.cpp
Install from brew
brew install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf snakech/cot_5k-GGUF:Q2_K
# Run inference directly in the terminal:
llama-cli -hf snakech/cot_5k-GGUF:Q2_K
Install from WinGet (Windows)
winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf snakech/cot_5k-GGUF:Q2_K
# Run inference directly in the terminal:
llama-cli -hf snakech/cot_5k-GGUF:Q2_K
Use pre-built binary
# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf snakech/cot_5k-GGUF:Q2_K
# Run inference directly in the terminal:
./llama-cli -hf snakech/cot_5k-GGUF:Q2_K
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf snakech/cot_5k-GGUF:Q2_K
# Run inference directly in the terminal:
./build/bin/llama-cli -hf snakech/cot_5k-GGUF:Q2_K
Use Docker
docker model run hf.co/snakech/cot_5k-GGUF:Q2_K
Quick Links

FabienRoger/cot_5k - GGUF

This repo contains GGUF format model files for FabienRoger/cot_5k.

they are compatible with llama.cpp as of commit b4011.

Prompt template

<|system|>
{system_prompt}<|endoftext|>
<|user|>
{prompt}<|endoftext|>
<|assistant|>

Model file specification

Filename Quant type File Size Description
cot_5k-Q2_K.gguf Q2_K 0.646 GB smallest, significant quality loss - not recommended for most purposes

Downloading instruction

Command line

Firstly, install Huggingface Client

pip install -U "huggingface_hub[cli]"

Then, downoad the individual model file the a local directory

huggingface-cli download tensorblock/cot_5k-GGUF --include "cot_5k-Q2_K.gguf" --local-dir MY_LOCAL_DIR

If you wanna download multiple model files with a pattern (e.g., *Q4_K*gguf), you can try:

huggingface-cli download tensorblock/cot_5k-GGUF --local-dir MY_LOCAL_DIR --local-dir-use-symlinks False --include='*Q4_K*gguf'
Downloads last month
7
GGUF
Model size
2B params
Architecture
stablelm
Hardware compatibility
Log In to add your hardware

2-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for snakech/cot_5k-GGUF

Quantized
(2)
this model