Sumitc13 commited on
Commit
cee1078
·
verified ·
1 Parent(s): 82eda13

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -16,7 +16,7 @@ quantized_by: Sumitc13
16
 
17
  GGUF quantizations of [sarvamai/sarvam-30b](https://huggingface.co/sarvamai/sarvam-30b) for use with [llama.cpp](https://github.com/ggml-org/llama.cpp).
18
 
19
- > **Note:** This model requires a custom build of llama.cpp with `sarvam_moe` architecture support. See [PR #pending](https://github.com/ggml-org/llama.cpp/pull/) or build from the [add-sarvam-moe branch](https://github.com/sumitchatterjee13/llama.cpp/tree/add-sarvam-moe).
20
 
21
  ## Available Quantizations
22
 
 
16
 
17
  GGUF quantizations of [sarvamai/sarvam-30b](https://huggingface.co/sarvamai/sarvam-30b) for use with [llama.cpp](https://github.com/ggml-org/llama.cpp).
18
 
19
+ > **Note:** This model requires a custom build of llama.cpp with `sarvam_moe` architecture support. See [PR #20275](https://github.com/ggml-org/llama.cpp/pull/20275) or build from the [add-sarvam-moe branch](https://github.com/sumitchatterjee13/llama.cpp/tree/add-sarvam-moe).
20
 
21
  ## Available Quantizations
22