|
|
--- |
|
|
base_model: |
|
|
- Pavariss/DeepSeek-R1-ThaiInsurance-COT-Demo1 |
|
|
--- |
|
|
# DeepSeek-R1-ThaiInsurance-COT-Demo1 - GGUF |
|
|
|
|
|
## About |
|
|
This repository contains GGUF weights for [Pavariss/DeepSeek-R1-ThaiInsurance-COT-Demo1](https://huggingface.co/Pavariss/DeepSeek-R1-ThaiInsurance-COT-Demo1). |
|
|
|
|
|
For a convenient overview and download list, visit our [model page](https://huggingface.co/sthaps/DeepSeek-R1-ThaiInsurance-COT-Demo1). |
|
|
|
|
|
## Usage |
|
|
If you are unsure how to use GGUF files, refer to the [llama.cpp documentation](https://github.com/ggerganov/llama.cpp) for more details. |
|
|
|
|
|
### Llama.cpp CLI |
|
|
```bash |
|
|
./llama-cli -m DeepSeek-R1-ThaiInsurance-COT-Demo1-q4_k_m.gguf -p "Hello!" |
|
|
``` |
|
|
|
|
|
## Provided Quants |
|
|
|
|
|
(sorted by size, not necessarily quality) |
|
|
|
|
|
| Link | Type | Size/GB | Notes | |
|
|
| :--- | :--- | :---: | :--- | |
|
|
| [GGUF](https://huggingface.co/sthaps/DeepSeek-R1-ThaiInsurance-COT-Demo1/blob/main/DeepSeek-R1-ThaiInsurance-COT-Demo1-q2_k.gguf) | q2_k | 2.96 | very low quality, for testing | |
|
|
| [GGUF](https://huggingface.co/sthaps/DeepSeek-R1-ThaiInsurance-COT-Demo1/blob/main/DeepSeek-R1-ThaiInsurance-COT-Demo1-q3_k_m.gguf) | q3_k_m | 3.74 | | |
|
|
| [GGUF](https://huggingface.co/sthaps/DeepSeek-R1-ThaiInsurance-COT-Demo1/blob/main/DeepSeek-R1-ThaiInsurance-COT-Demo1-q4_0.gguf) | q4_0 | 4.34 | | |
|
|
| [GGUF](https://huggingface.co/sthaps/DeepSeek-R1-ThaiInsurance-COT-Demo1/blob/main/DeepSeek-R1-ThaiInsurance-COT-Demo1-q4_k_m.gguf) | q4_k_m | 4.58 | recommended, good balance | |
|
|
| [GGUF](https://huggingface.co/sthaps/DeepSeek-R1-ThaiInsurance-COT-Demo1/blob/main/DeepSeek-R1-ThaiInsurance-COT-Demo1-q5_k_m.gguf) | q5_k_m | 5.34 | | |
|
|
| [GGUF](https://huggingface.co/sthaps/DeepSeek-R1-ThaiInsurance-COT-Demo1/blob/main/DeepSeek-R1-ThaiInsurance-COT-Demo1-q8_0.gguf) | q8_0 | 7.95 | near-full precision | |
|
|
|
|
|
## Thanks |
|
|
Special thanks to the `llama.cpp` team for their amazing work. |