athirdpath commited on
Commit
efcea59
·
1 Parent(s): b3dfbc3

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +32 -0
README.md ADDED
@@ -0,0 +1,32 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ ---
4
+ EDIT: Works pretty well for a model with no finetuning, has promise. Better and lighter than the 14b.
5
+
6
+ A 13b Mistral base model, based on the NeverSleep recipe. We've had second Mistral, why not third Mistral?
7
+
8
+ ### Recipe
9
+
10
+ slices
11
+
12
+ - sources:
13
+ -
14
+ - model: mistralai/Mistral-7B-v0.1
15
+ -
16
+ layer_range: [0, 24]
17
+
18
+ - sources:
19
+ -
20
+ - model: mistralai/Mistral-7B-v0.1
21
+ -
22
+ layer_range: [12, 24]
23
+
24
+ - sources:
25
+ -
26
+ - model: mistralai/Mistral-7B-v0.1
27
+ -
28
+ layer_range: [8, 32]
29
+
30
+ merge_method: passthrough
31
+
32
+ dtype: bfloat16