How to use from
llama.cppInstall from WinGet (Windows)
winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf afrideva/tinyllama-python-GGUF:# Run inference directly in the terminal:
llama-cli -hf afrideva/tinyllama-python-GGUF:Use pre-built binary
# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf afrideva/tinyllama-python-GGUF:# Run inference directly in the terminal:
./llama-cli -hf afrideva/tinyllama-python-GGUF:Build from source code
git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf afrideva/tinyllama-python-GGUF:# Run inference directly in the terminal:
./build/bin/llama-cli -hf afrideva/tinyllama-python-GGUF:Use Docker
docker model run hf.co/afrideva/tinyllama-python-GGUF:Quick Links
rahuldshetty/tinyllama-python-GGUF
Quantized GGUF model files for tinyllama-python from rahuldshetty
| Name | Quant method | Size |
|---|---|---|
| tinyllama-python.fp16.gguf | fp16 | 2.20 GB |
| tinyllama-python.q2_k.gguf | q2_k | 432.13 MB |
| tinyllama-python.q3_k_m.gguf | q3_k_m | 548.40 MB |
| tinyllama-python.q4_k_m.gguf | q4_k_m | 667.81 MB |
| tinyllama-python.q5_k_m.gguf | q5_k_m | 782.04 MB |
| tinyllama-python.q6_k.gguf | q6_k | 903.41 MB |
| tinyllama-python.q8_0.gguf | q8_0 | 1.17 GB |
Original Model Card:
rahuldshetty/tinyllama-python-gguf
- Base model: unsloth/tinyllama-bnb-4bit
- Dataset: iamtarun/python_code_instructions_18k_alpaca
- Training Script: unslothai: Alpaca + TinyLlama + RoPE Scaling full example.ipynb
Prompt Format
### Instruction:
{instruction}
### Response:
Example
### Instruction:
Write a function to find cube of a number.
### Response:
- Downloads last month
- 199
Hardware compatibility
Log In to add your hardware
Model tree for afrideva/tinyllama-python-GGUF
Base model
rahuldshetty/tinyllama-python
Install from brew
# Start a local OpenAI-compatible server with a web UI: llama-server -hf afrideva/tinyllama-python-GGUF:# Run inference directly in the terminal: llama-cli -hf afrideva/tinyllama-python-GGUF: