Improve language tag

#1
by lbourdois - opened
Files changed (1) hide show
  1. README.md +124 -111
README.md CHANGED
@@ -1,111 +1,124 @@
1
- ---
2
- base_model:
3
- - Qwen/Qwen2.5-1.5B-Instruct
4
- library_name: transformers
5
- tags:
6
- - mergekit
7
- - merge
8
-
9
- ---
10
- # merge
11
-
12
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
13
-
14
- ## Merge Details
15
- ### Merge Method
16
-
17
- This model was merged using the passthrough merge method.
18
-
19
- ### Models Merged
20
-
21
- The following models were included in the merge:
22
- * [Qwen/Qwen2.5-1.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-1.5B-Instruct)
23
-
24
- ### Configuration
25
-
26
- The following YAML configuration was used to produce this model:
27
-
28
- ```yaml
29
- dtype: bfloat16
30
- merge_method: passthrough
31
- slices:
32
- - sources:
33
- - layer_range: [0, 2]
34
- model: Qwen/Qwen2.5-1.5B-Instruct
35
- - sources:
36
- - layer_range: [1, 3]
37
- model: Qwen/Qwen2.5-1.5B-Instruct
38
- - sources:
39
- - layer_range: [2, 4]
40
- model: Qwen/Qwen2.5-1.5B-Instruct
41
- - sources:
42
- - layer_range: [3, 5]
43
- model: Qwen/Qwen2.5-1.5B-Instruct
44
- - sources:
45
- - layer_range: [4, 6]
46
- model: Qwen/Qwen2.5-1.5B-Instruct
47
- - sources:
48
- - layer_range: [5, 7]
49
- model: Qwen/Qwen2.5-1.5B-Instruct
50
- - sources:
51
- - layer_range: [6, 8]
52
- model: Qwen/Qwen2.5-1.5B-Instruct
53
- - sources:
54
- - layer_range: [7, 9]
55
- model: Qwen/Qwen2.5-1.5B-Instruct
56
- - sources:
57
- - layer_range: [8, 10]
58
- model: Qwen/Qwen2.5-1.5B-Instruct
59
- - sources:
60
- - layer_range: [9, 11]
61
- model: Qwen/Qwen2.5-1.5B-Instruct
62
- - sources:
63
- - layer_range: [10, 12]
64
- model: Qwen/Qwen2.5-1.5B-Instruct
65
- - sources:
66
- - layer_range: [11, 13]
67
- model: Qwen/Qwen2.5-1.5B-Instruct
68
- - sources:
69
- - layer_range: [12, 14]
70
- model: Qwen/Qwen2.5-1.5B-Instruct
71
- - sources:
72
- - layer_range: [13, 15]
73
- model: Qwen/Qwen2.5-1.5B-Instruct
74
- - sources:
75
- - layer_range: [14, 16]
76
- model: Qwen/Qwen2.5-1.5B-Instruct
77
- - sources:
78
- - layer_range: [16, 18]
79
- model: Qwen/Qwen2.5-1.5B-Instruct
80
- - sources:
81
- - layer_range: [17, 19]
82
- model: Qwen/Qwen2.5-1.5B-Instruct
83
- - sources:
84
- - layer_range: [18, 20]
85
- model: Qwen/Qwen2.5-1.5B-Instruct
86
- - sources:
87
- - layer_range: [19, 21]
88
- model: Qwen/Qwen2.5-1.5B-Instruct
89
- - sources:
90
- - layer_range: [20, 22]
91
- model: Qwen/Qwen2.5-1.5B-Instruct
92
- - sources:
93
- - layer_range: [21, 23]
94
- model: Qwen/Qwen2.5-1.5B-Instruct
95
- - sources:
96
- - layer_range: [22, 24]
97
- model: Qwen/Qwen2.5-1.5B-Instruct
98
- - sources:
99
- - layer_range: [23, 25]
100
- model: Qwen/Qwen2.5-1.5B-Instruct
101
- - sources:
102
- - layer_range: [24, 26]
103
- model: Qwen/Qwen2.5-1.5B-Instruct
104
- - sources:
105
- - layer_range: [25, 27]
106
- model: Qwen/Qwen2.5-1.5B-Instruct
107
- - sources:
108
- - layer_range: [26, 28]
109
- model: Qwen/Qwen2.5-1.5B-Instruct
110
-
111
- ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - Qwen/Qwen2.5-1.5B-Instruct
4
+ library_name: transformers
5
+ tags:
6
+ - mergekit
7
+ - merge
8
+ language:
9
+ - zho
10
+ - eng
11
+ - fra
12
+ - spa
13
+ - por
14
+ - deu
15
+ - ita
16
+ - rus
17
+ - jpn
18
+ - kor
19
+ - vie
20
+ - tha
21
+ - ara
22
+ ---
23
+ # merge
24
+
25
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
26
+
27
+ ## Merge Details
28
+ ### Merge Method
29
+
30
+ This model was merged using the passthrough merge method.
31
+
32
+ ### Models Merged
33
+
34
+ The following models were included in the merge:
35
+ * [Qwen/Qwen2.5-1.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-1.5B-Instruct)
36
+
37
+ ### Configuration
38
+
39
+ The following YAML configuration was used to produce this model:
40
+
41
+ ```yaml
42
+ dtype: bfloat16
43
+ merge_method: passthrough
44
+ slices:
45
+ - sources:
46
+ - layer_range: [0, 2]
47
+ model: Qwen/Qwen2.5-1.5B-Instruct
48
+ - sources:
49
+ - layer_range: [1, 3]
50
+ model: Qwen/Qwen2.5-1.5B-Instruct
51
+ - sources:
52
+ - layer_range: [2, 4]
53
+ model: Qwen/Qwen2.5-1.5B-Instruct
54
+ - sources:
55
+ - layer_range: [3, 5]
56
+ model: Qwen/Qwen2.5-1.5B-Instruct
57
+ - sources:
58
+ - layer_range: [4, 6]
59
+ model: Qwen/Qwen2.5-1.5B-Instruct
60
+ - sources:
61
+ - layer_range: [5, 7]
62
+ model: Qwen/Qwen2.5-1.5B-Instruct
63
+ - sources:
64
+ - layer_range: [6, 8]
65
+ model: Qwen/Qwen2.5-1.5B-Instruct
66
+ - sources:
67
+ - layer_range: [7, 9]
68
+ model: Qwen/Qwen2.5-1.5B-Instruct
69
+ - sources:
70
+ - layer_range: [8, 10]
71
+ model: Qwen/Qwen2.5-1.5B-Instruct
72
+ - sources:
73
+ - layer_range: [9, 11]
74
+ model: Qwen/Qwen2.5-1.5B-Instruct
75
+ - sources:
76
+ - layer_range: [10, 12]
77
+ model: Qwen/Qwen2.5-1.5B-Instruct
78
+ - sources:
79
+ - layer_range: [11, 13]
80
+ model: Qwen/Qwen2.5-1.5B-Instruct
81
+ - sources:
82
+ - layer_range: [12, 14]
83
+ model: Qwen/Qwen2.5-1.5B-Instruct
84
+ - sources:
85
+ - layer_range: [13, 15]
86
+ model: Qwen/Qwen2.5-1.5B-Instruct
87
+ - sources:
88
+ - layer_range: [14, 16]
89
+ model: Qwen/Qwen2.5-1.5B-Instruct
90
+ - sources:
91
+ - layer_range: [16, 18]
92
+ model: Qwen/Qwen2.5-1.5B-Instruct
93
+ - sources:
94
+ - layer_range: [17, 19]
95
+ model: Qwen/Qwen2.5-1.5B-Instruct
96
+ - sources:
97
+ - layer_range: [18, 20]
98
+ model: Qwen/Qwen2.5-1.5B-Instruct
99
+ - sources:
100
+ - layer_range: [19, 21]
101
+ model: Qwen/Qwen2.5-1.5B-Instruct
102
+ - sources:
103
+ - layer_range: [20, 22]
104
+ model: Qwen/Qwen2.5-1.5B-Instruct
105
+ - sources:
106
+ - layer_range: [21, 23]
107
+ model: Qwen/Qwen2.5-1.5B-Instruct
108
+ - sources:
109
+ - layer_range: [22, 24]
110
+ model: Qwen/Qwen2.5-1.5B-Instruct
111
+ - sources:
112
+ - layer_range: [23, 25]
113
+ model: Qwen/Qwen2.5-1.5B-Instruct
114
+ - sources:
115
+ - layer_range: [24, 26]
116
+ model: Qwen/Qwen2.5-1.5B-Instruct
117
+ - sources:
118
+ - layer_range: [25, 27]
119
+ model: Qwen/Qwen2.5-1.5B-Instruct
120
+ - sources:
121
+ - layer_range: [26, 28]
122
+ model: Qwen/Qwen2.5-1.5B-Instruct
123
+
124
+ ```