ymcki
/

DeciLM-7B-Instruct-GGUF

Text Generation

Model card Files Files and versions

ymcki commited on Nov 20, 2024

Commit

f52b3e4

·

verified ·

1 Parent(s): 9ce1a64

Upload README.md

Files changed (1) hide show

README.md +3 -6

README.md CHANGED Viewed

@@ -28,12 +28,9 @@ Original model: https://huggingface.co/Deci/DeciLM-7B-Instruct
 ```
-Modified llama.cpp to support DeciLMCausalModel's variable Grouped Query Attention.
-Please note that the HF model of Deci-7B-Instruct uses dynamic NTK-ware RoPE scaling.
-However, llama.cpp doesn't support it yet, so my modifification also just ignore
-the dynamic NTK-ware RoPE scaling settings in the config.json. Since the ggufs seem
-working, for the time being just use them as is until I figure out how to implement
-dynamic NTK-ware RoPE scaling.
 ## Download a file (not the whole branch) from below:

 ```
+[Modified llama.cpp](https://github.com/ymcki/llama.cpp-b4139) to support DeciLMCausalModel's variable Grouped Query Attention. Please download it and compile it to run the GGUFs in this repository.
+Please note that the HF model of Deci-7B-Instruct uses dynamic NTK-ware RoPE scaling. However, llama.cpp doesn't support it yet, so my modifification also just ignore the dynamic NTK-ware RoPE scaling setting in the config.json. Since the ggufs seem working for the time being, please just use them as is until I figure out how to implement dynamic NTK-ware RoPE scaling.
 ## Download a file (not the whole branch) from below: