Quantized
Collection
A collection of quantized model I found.
•
8 items
•
Updated
This repository contains GGUF weights for Pavariss/DeepSeek-R1-ThaiInsurance-COT-Demo1.
For a convenient overview and download list, visit our model page.
If you are unsure how to use GGUF files, refer to the llama.cpp documentation for more details.
./llama-cli -m DeepSeek-R1-ThaiInsurance-COT-Demo1-q4_k_m.gguf -p "Hello!"
(sorted by size, not necessarily quality)
| Link | Type | Size/GB | Notes |
|---|---|---|---|
| GGUF | q2_k | 2.96 | very low quality, for testing |
| GGUF | q3_k_m | 3.74 | |
| GGUF | q4_0 | 4.34 | |
| GGUF | q4_k_m | 4.58 | recommended, good balance |
| GGUF | q5_k_m | 5.34 | |
| GGUF | q8_0 | 7.95 | near-full precision |
Special thanks to the llama.cpp team for their amazing work.
2-bit
3-bit
4-bit
5-bit
8-bit
Base model
Pavariss/DeepSeek-R1-ThaiInsurance-COT-Demo1