lbourdois commited on
Commit
9765536
·
verified ·
1 Parent(s): 90aecec

Improve language tag

Browse files

Hi! As the model is multilingual, this is a PR to add other languages than English to the language tag to improve the referencing. Note that 29 languages are announced in the README, but only 13 are explicitly listed. I was therefore only able to add these 13 languages.

Files changed (1) hide show
  1. README.md +66 -66
README.md CHANGED
@@ -1,66 +1,66 @@
1
- ---
2
- base_model:
3
- - Qwen/Qwen2.5-Coder-7B-Instruct
4
- - Qwen/Qwen2.5-Math-7B-Instruct
5
- - Qwen/Qwen2.5-7B
6
- - Qwen/Qwen2.5-7B-Instruct
7
- library_name: transformers
8
- tags:
9
- - mergekit
10
- - merge
11
- language:
12
- - zho
13
- - eng
14
- - fra
15
- - spa
16
- - por
17
- - deu
18
- - ita
19
- - rus
20
- - jpn
21
- - kor
22
- - vie
23
- - tha
24
- - ara
25
- ---
26
- # merge
27
-
28
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). Requires fine-tuning.
29
-
30
- ## Merge Details
31
- ### Merge Method
32
-
33
- This model was merged using the [DARE](https://arxiv.org/abs/2311.03099) [TIES](https://arxiv.org/abs/2306.01708) merge method using [Qwen/Qwen2.5-7B](https://huggingface.co/Qwen/Qwen2.5-7B) as a base.
34
-
35
- ### Models Merged
36
-
37
- The following models were included in the merge:
38
- * [Qwen/Qwen2.5-Coder-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-Coder-7B-Instruct)
39
- * [Qwen/Qwen2.5-Math-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-Math-7B-Instruct)
40
- * [Qwen/Qwen2.5-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct)
41
-
42
- ### Configuration
43
-
44
- The following YAML configuration was used to produce this model:
45
-
46
- ```yaml
47
- models:
48
- - model: Qwen/Qwen2.5-7B
49
- # No parameters necessary for base model
50
- - model: Qwen/Qwen2.5-7B-Instruct
51
- parameters:
52
- density: 0.6
53
- weight: 0.62
54
- - model: Qwen/Qwen2.5-Coder-7B-Instruct
55
- parameters:
56
- density: 0.56
57
- weight: 0.58
58
- - model: Qwen/Qwen2.5-Math-7B-Instruct
59
- parameters:
60
- density: 0.6
61
- weight: 0.56
62
- merge_method: dare_ties
63
- base_model: Qwen/Qwen2.5-7B
64
- dtype: bfloat16
65
-
66
- ```
 
1
+ ---
2
+ base_model:
3
+ - Qwen/Qwen2.5-Coder-7B-Instruct
4
+ - Qwen/Qwen2.5-Math-7B-Instruct
5
+ - Qwen/Qwen2.5-7B
6
+ - Qwen/Qwen2.5-7B-Instruct
7
+ library_name: transformers
8
+ tags:
9
+ - mergekit
10
+ - merge
11
+ language:
12
+ - zho
13
+ - eng
14
+ - fra
15
+ - spa
16
+ - por
17
+ - deu
18
+ - ita
19
+ - rus
20
+ - jpn
21
+ - kor
22
+ - vie
23
+ - tha
24
+ - ara
25
+ ---
26
+ # merge
27
+
28
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). Requires fine-tuning.
29
+
30
+ ## Merge Details
31
+ ### Merge Method
32
+
33
+ This model was merged using the [DARE](https://arxiv.org/abs/2311.03099) [TIES](https://arxiv.org/abs/2306.01708) merge method using [Qwen/Qwen2.5-7B](https://huggingface.co/Qwen/Qwen2.5-7B) as a base.
34
+
35
+ ### Models Merged
36
+
37
+ The following models were included in the merge:
38
+ * [Qwen/Qwen2.5-Coder-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-Coder-7B-Instruct)
39
+ * [Qwen/Qwen2.5-Math-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-Math-7B-Instruct)
40
+ * [Qwen/Qwen2.5-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct)
41
+
42
+ ### Configuration
43
+
44
+ The following YAML configuration was used to produce this model:
45
+
46
+ ```yaml
47
+ models:
48
+ - model: Qwen/Qwen2.5-7B
49
+ # No parameters necessary for base model
50
+ - model: Qwen/Qwen2.5-7B-Instruct
51
+ parameters:
52
+ density: 0.6
53
+ weight: 0.62
54
+ - model: Qwen/Qwen2.5-Coder-7B-Instruct
55
+ parameters:
56
+ density: 0.56
57
+ weight: 0.58
58
+ - model: Qwen/Qwen2.5-Math-7B-Instruct
59
+ parameters:
60
+ density: 0.6
61
+ weight: 0.56
62
+ merge_method: dare_ties
63
+ base_model: Qwen/Qwen2.5-7B
64
+ dtype: bfloat16
65
+
66
+ ```