Sumitc13
/

sarvam-30b-GGUF

@@ -16,7 +16,7 @@ quantized_by: Sumitc13
 GGUF quantizations of [sarvamai/sarvam-30b](https://huggingface.co/sarvamai/sarvam-30b) for use with [llama.cpp](https://github.com/ggml-org/llama.cpp).
-> **Note:** This model requires a custom build of llama.cpp with `sarvam_moe` architecture support. See [PR #pending](https://github.com/ggml-org/llama.cpp/pull/) or build from the [add-sarvam-moe branch](https://github.com/sumitchatterjee13/llama.cpp/tree/add-sarvam-moe).
 ## Available Quantizations

 GGUF quantizations of [sarvamai/sarvam-30b](https://huggingface.co/sarvamai/sarvam-30b) for use with [llama.cpp](https://github.com/ggml-org/llama.cpp).
+> **Note:** This model requires a custom build of llama.cpp with `sarvam_moe` architecture support. See [PR #20275](https://github.com/ggml-org/llama.cpp/pull/20275) or build from the [add-sarvam-moe branch](https://github.com/sumitchatterjee13/llama.cpp/tree/add-sarvam-moe).
 ## Available Quantizations