mllm-dev commited on
Commit
1567d2f
·
verified ·
1 Parent(s): 34c539a

Upload folder using huggingface_hub

Browse files
README.md CHANGED
@@ -1,7 +1,7 @@
1
  ---
2
  base_model:
3
- - mllm-dev/gpt2_f_experiment_4
4
  - mllm-dev/gpt2_f_experiment_1
 
5
  - mllm-dev/gpt2_f_experiment_2
6
  - mllm-dev/gpt2_f_experiment_0
7
  - mllm-dev/gpt2_f_experiment_3
@@ -18,13 +18,13 @@ This is a merge of pre-trained language models created using [mergekit](https://
18
  ## Merge Details
19
  ### Merge Method
20
 
21
- This model was merged using the [linear](https://arxiv.org/abs/2203.05482) merge method using [mllm-dev/gpt2_f_experiment_0](https://huggingface.co/mllm-dev/gpt2_f_experiment_0) as a base.
22
 
23
  ### Models Merged
24
 
25
  The following models were included in the merge:
26
- * [mllm-dev/gpt2_f_experiment_4](https://huggingface.co/mllm-dev/gpt2_f_experiment_4)
27
  * [mllm-dev/gpt2_f_experiment_1](https://huggingface.co/mllm-dev/gpt2_f_experiment_1)
 
28
  * [mllm-dev/gpt2_f_experiment_2](https://huggingface.co/mllm-dev/gpt2_f_experiment_2)
29
  * [mllm-dev/gpt2_f_experiment_3](https://huggingface.co/mllm-dev/gpt2_f_experiment_3)
30
 
@@ -37,7 +37,7 @@ base_model:
37
  model:
38
  path: mllm-dev/gpt2_f_experiment_0
39
  dtype: float16
40
- merge_method: linear
41
  slices:
42
  - sources:
43
  - layer_range: [0, 12]
 
1
  ---
2
  base_model:
 
3
  - mllm-dev/gpt2_f_experiment_1
4
+ - mllm-dev/gpt2_f_experiment_4
5
  - mllm-dev/gpt2_f_experiment_2
6
  - mllm-dev/gpt2_f_experiment_0
7
  - mllm-dev/gpt2_f_experiment_3
 
18
  ## Merge Details
19
  ### Merge Method
20
 
21
+ This model was merged using the linear [DARE](https://arxiv.org/abs/2311.03099) merge method using [mllm-dev/gpt2_f_experiment_0](https://huggingface.co/mllm-dev/gpt2_f_experiment_0) as a base.
22
 
23
  ### Models Merged
24
 
25
  The following models were included in the merge:
 
26
  * [mllm-dev/gpt2_f_experiment_1](https://huggingface.co/mllm-dev/gpt2_f_experiment_1)
27
+ * [mllm-dev/gpt2_f_experiment_4](https://huggingface.co/mllm-dev/gpt2_f_experiment_4)
28
  * [mllm-dev/gpt2_f_experiment_2](https://huggingface.co/mllm-dev/gpt2_f_experiment_2)
29
  * [mllm-dev/gpt2_f_experiment_3](https://huggingface.co/mllm-dev/gpt2_f_experiment_3)
30
 
 
37
  model:
38
  path: mllm-dev/gpt2_f_experiment_0
39
  dtype: float16
40
+ merge_method: dare_linear
41
  slices:
42
  - sources:
43
  - layer_range: [0, 12]
mergekit_config.yml CHANGED
@@ -2,7 +2,7 @@ base_model:
2
  model:
3
  path: mllm-dev/gpt2_f_experiment_0
4
  dtype: float16
5
- merge_method: linear
6
  slices:
7
  - sources:
8
  - layer_range: [0, 12]
 
2
  model:
3
  path: mllm-dev/gpt2_f_experiment_0
4
  dtype: float16
5
+ merge_method: dare_linear
6
  slices:
7
  - sources:
8
  - layer_range: [0, 12]
model-00001-of-00001.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:109d22198c42220534f2b55ff9566334f14c2d3c6976f90d83b3d654b92dbc74
3
  size 248902264
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:aaf8b491f91cfe5f946b0c2df0007e4d670fffa62d27239b1f5d52ffed7b7a2f
3
  size 248902264