dranger003 commited on
Commit
2c8127e
·
verified ·
1 Parent(s): b40e4e6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -0
README.md CHANGED
@@ -1,3 +1,11 @@
1
  ---
2
  license: cc-by-nc-4.0
 
 
3
  ---
 
 
 
 
 
 
 
1
  ---
2
  license: cc-by-nc-4.0
3
+ pipeline_tag: text-generation
4
+ library_name: gguf
5
  ---
6
+ GGUF importance matrix (imatrix) quants for https://huggingface.co/mlabonne/NeuralMonarch-7B
7
+ The importance matrix was trained for ~50K tokens (105 batches of 512 tokens) using a [general purpose imatrix calibration dataset](https://github.com/ggerganov/llama.cpp/discussions/5263#discussioncomment-8395384).
8
+
9
+ | Layers | Context | Template |
10
+ | --- | --- | --- |
11
+ | <pre>32</pre> | <pre>32768</pre> | <pre>\<s\>user<br>{prompt}\</s\><br>\<s\>assistant<br>{response}</pre> |