Envoid commited on
Commit
9c88d47
·
1 Parent(s): 7b5af03

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -5,6 +5,8 @@ license: cc-by-nc-4.0
5
 
6
  This model was created by taking [Libra19B](https://huggingface.co/Envoid/Libra19B) and then using the [frankenllama script](https://huggingface.co/chargoddard/llama2-22b) to perform a block diagonal merge with [Enterredaas 33B](https://huggingface.co/Aeala/Enterredaas-33b).
7
 
 
 
8
  ## Unnatural corpus:
9
 
10
  I then used the included autocorpus.py script to generate 20 megabytes of raw text samples using Libra19B
 
5
 
6
  This model was created by taking [Libra19B](https://huggingface.co/Envoid/Libra19B) and then using the [frankenllama script](https://huggingface.co/chargoddard/llama2-22b) to perform a block diagonal merge with [Enterredaas 33B](https://huggingface.co/Aeala/Enterredaas-33b).
7
 
8
+ Unfortunately due to the lack of GQA **it does not** fit on a single 24 Gigabyte GPU at 4096 context and thus all testing was done with only 55 layers offloaded to GPU via q4_K_M gguf format. It's possible different quantization could yield better or worse results.
9
+
10
  ## Unnatural corpus:
11
 
12
  I then used the included autocorpus.py script to generate 20 megabytes of raw text samples using Libra19B