Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -29,8 +29,8 @@ Brought to you by the [LocalAI](https://github.com/mudler/LocalAI) team. These m
|
|
| 29 |
|------|--------------|-------------|
|
| 30 |
| `ggml-model-f16.bin` | float16 | Full precision (no quantization) — highest quality |
|
| 31 |
| `ggml-model-q8_0.bin` | int8 | 8-bit quantization — minimal quality loss |
|
| 32 |
-
| `ggml-model-
|
| 33 |
-
| `ggml-model-
|
| 34 |
|
| 35 |
## Training
|
| 36 |
|
|
|
|
| 29 |
|------|--------------|-------------|
|
| 30 |
| `ggml-model-f16.bin` | float16 | Full precision (no quantization) — highest quality |
|
| 31 |
| `ggml-model-q8_0.bin` | int8 | 8-bit quantization — minimal quality loss |
|
| 32 |
+
| `ggml-model-q5_0.bin` | int5 | 5-bit quantization — good quality/size tradeoff |
|
| 33 |
+
| `ggml-model-q4_0.bin` | int4 | 4-bit quantization — smallest size, fastest |
|
| 34 |
|
| 35 |
## Training
|
| 36 |
|