Improve language tag

#1
by lbourdois - opened
Files changed (1) hide show
  1. README.md +59 -46
README.md CHANGED
@@ -1,46 +1,59 @@
1
- ---
2
- base_model:
3
- - Qwen/Qwen2.5-0.5B-Instruct
4
- library_name: transformers
5
- tags:
6
- - mergekit
7
- - merge
8
-
9
- ---
10
- # merge
11
-
12
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
13
-
14
- ## Merge Details
15
- ### Merge Method
16
-
17
- This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using [Qwen/Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct) as a base.
18
-
19
- ### Models Merged
20
-
21
- The following models were included in the merge:
22
-
23
-
24
- ### Configuration
25
-
26
- The following YAML configuration was used to produce this model:
27
-
28
- ```yaml
29
- slices:
30
- - sources:
31
- - model: Qwen/Qwen2.5-0.5B-Instruct
32
- layer_range: [0, 11]
33
- - model: Qwen/Qwen2.5-0.5B-Instruct
34
- layer_range: [12, 23]
35
- merge_method: ties
36
- base_model: Qwen/Qwen2.5-0.5B-Instruct
37
- parameters:
38
- t:
39
- - filter: self_attn
40
- value: [0.1, 0.3, 0.5, 0.7, 0.9]
41
- - filter: mlp
42
- value: [0.1, 0.3, 0.5, 0.7, 0.9]
43
- - value: 0.5
44
- dtype: bfloat16
45
-
46
- ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - Qwen/Qwen2.5-0.5B-Instruct
4
+ library_name: transformers
5
+ tags:
6
+ - mergekit
7
+ - merge
8
+ language:
9
+ - zho
10
+ - eng
11
+ - fra
12
+ - spa
13
+ - por
14
+ - deu
15
+ - ita
16
+ - rus
17
+ - jpn
18
+ - kor
19
+ - vie
20
+ - tha
21
+ - ara
22
+ ---
23
+ # merge
24
+
25
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
26
+
27
+ ## Merge Details
28
+ ### Merge Method
29
+
30
+ This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using [Qwen/Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct) as a base.
31
+
32
+ ### Models Merged
33
+
34
+ The following models were included in the merge:
35
+
36
+
37
+ ### Configuration
38
+
39
+ The following YAML configuration was used to produce this model:
40
+
41
+ ```yaml
42
+ slices:
43
+ - sources:
44
+ - model: Qwen/Qwen2.5-0.5B-Instruct
45
+ layer_range: [0, 11]
46
+ - model: Qwen/Qwen2.5-0.5B-Instruct
47
+ layer_range: [12, 23]
48
+ merge_method: ties
49
+ base_model: Qwen/Qwen2.5-0.5B-Instruct
50
+ parameters:
51
+ t:
52
+ - filter: self_attn
53
+ value: [0.1, 0.3, 0.5, 0.7, 0.9]
54
+ - filter: mlp
55
+ value: [0.1, 0.3, 0.5, 0.7, 0.9]
56
+ - value: 0.5
57
+ dtype: bfloat16
58
+
59
+ ```