nisten commited on
Commit
1bb83a8
·
1 Parent(s): 4bcb440

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +28 -0
README.md CHANGED
@@ -1,3 +1,31 @@
1
  ---
2
  license: apache-2.0
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
  ---
4
+
5
+ **Undi95 type frankenstein of TinyLLama 1.1b**
6
+ https://github.com/jzhang38/TinyLlama
7
+ https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v1.0
8
+
9
+ **GGUF custom quants included**
10
+
11
+ The secret sauce:
12
+
13
+ ```bash
14
+ slices:
15
+ - sources:
16
+ - model: "TinyLlama/TinyLlama-1.1B-Chat-v1.0"
17
+ layer_range: [0, 22]
18
+ - sources:
19
+ - model: "TinyLlama/TinyLlama-1.1B-Chat-v1.0"
20
+ layer_range: [8, 32]
21
+ merge_method: passthrough
22
+ dtype: bfloat16
23
+ ```
24
+
25
+ How to run as gguf:
26
+
27
+ ```bash
28
+ git clone https://github.com/ggerganov/llama.cpp && cd llama.cpp && make -j
29
+ wget https://huggingface.co/SkunkworksAI/tinyfrank-1.4B/resolve/main/tinyfrank-q6L.gguf
30
+ ./server -m tinyfrank-q6L.gguf --host "my.internal.ip.or.my.cloud.host.name.goes.here.com" -c 512
31
+ ```