Sumail commited on
Commit
c6d4830
·
verified ·
1 Parent(s): 6b12cba

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +0 -49
README.md CHANGED
@@ -1,49 +0,0 @@
1
- ---
2
- base_model:
3
- - deepnetguy/gemma-44
4
- - tomaszki/gemma-30
5
- library_name: transformers
6
- tags:
7
- - mergekit
8
- - merge
9
-
10
- ---
11
- # merge
12
-
13
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
14
-
15
- ## Merge Details
16
- ### Merge Method
17
-
18
- This model was merged using the SLERP merge method.
19
-
20
- ### Models Merged
21
-
22
- The following models were included in the merge:
23
- * [deepnetguy/gemma-44](https://huggingface.co/deepnetguy/gemma-44)
24
- * [tomaszki/gemma-30](https://huggingface.co/tomaszki/gemma-30)
25
-
26
- ### Configuration
27
-
28
- The following YAML configuration was used to produce this model:
29
-
30
- ```yaml
31
-
32
- slices:
33
- - sources:
34
- - model: deepnetguy/gemma-44
35
- layer_range: [0, 18]
36
- - model: tomaszki/gemma-30
37
- layer_range: [0, 18]
38
- merge_method: slerp
39
- base_model: deepnetguy/gemma-44
40
- parameters:
41
- t:
42
- - filter: self_attn
43
- value: [0, 0.5, 0.3, 0.7, 1]
44
- - filter: mlp
45
- value: [1, 0.5, 0.7, 0.3, 0]
46
- - value: 0.5
47
- dtype: bfloat16
48
-
49
- ```