Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -16,7 +16,7 @@ quantized_by: Sumitc13
|
|
| 16 |
|
| 17 |
GGUF quantizations of [sarvamai/sarvam-30b](https://huggingface.co/sarvamai/sarvam-30b) for use with [llama.cpp](https://github.com/ggml-org/llama.cpp).
|
| 18 |
|
| 19 |
-
> **Note:** This model requires a custom build of llama.cpp with `sarvam_moe` architecture support. See [PR #
|
| 20 |
|
| 21 |
## Available Quantizations
|
| 22 |
|
|
|
|
| 16 |
|
| 17 |
GGUF quantizations of [sarvamai/sarvam-30b](https://huggingface.co/sarvamai/sarvam-30b) for use with [llama.cpp](https://github.com/ggml-org/llama.cpp).
|
| 18 |
|
| 19 |
+
> **Note:** This model requires a custom build of llama.cpp with `sarvam_moe` architecture support. See [PR #20275](https://github.com/ggml-org/llama.cpp/pull/20275) or build from the [add-sarvam-moe branch](https://github.com/sumitchatterjee13/llama.cpp/tree/add-sarvam-moe).
|
| 20 |
|
| 21 |
## Available Quantizations
|
| 22 |
|