Improve language tag

#1
by lbourdois - opened
Files changed (1) hide show
  1. README.md +56 -42
README.md CHANGED
@@ -1,42 +1,56 @@
1
- ---
2
- base_model:
3
- - prithivMLmods/Blaze.1-32B-Instruct
4
- - Qwen/Qwen2.5-32B
5
- - Qwen/QwQ-32B-Preview
6
- library_name: transformers
7
- tags:
8
- - mergekit
9
- - merge
10
- ---
11
- ![omni.png](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/zHI_YD5JjDHEEjZ3ubUrE.png)
12
-
13
- # **Merge QwQ Based**
14
-
15
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
16
-
17
- ## Merge Details
18
- ### Merge Method
19
-
20
- This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [Qwen/QwQ-32B-Preview](https://huggingface.co/Qwen/QwQ-32B-Preview) as a base.
21
-
22
- ### Models Merged
23
-
24
- The following models were included in the merge:
25
- * [prithivMLmods/Blaze.1-32B-Instruct](https://huggingface.co/prithivMLmods/Blaze.1-32B-Instruct)
26
- * [Qwen/Qwen2.5-32B](https://huggingface.co/Qwen/Qwen2.5-32B)
27
-
28
- ### Configuration
29
-
30
- The following YAML configuration was used to produce this model:
31
-
32
- ```yaml
33
- models:
34
- - model: prithivMLmods/Blaze.1-32B-Instruct
35
- - model: Qwen/Qwen2.5-32B
36
- - model: Qwen/QwQ-32B-Preview
37
- merge_method: model_stock
38
- base_model: Qwen/QwQ-32B-Preview
39
- normalize: true
40
- int8_mask: true
41
- dtype: bfloat16
42
- ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - prithivMLmods/Blaze.1-32B-Instruct
4
+ - Qwen/Qwen2.5-32B
5
+ - Qwen/QwQ-32B-Preview
6
+ library_name: transformers
7
+ tags:
8
+ - mergekit
9
+ - merge
10
+ language:
11
+ - zho
12
+ - eng
13
+ - fra
14
+ - spa
15
+ - por
16
+ - deu
17
+ - ita
18
+ - rus
19
+ - jpn
20
+ - kor
21
+ - vie
22
+ - tha
23
+ - ara
24
+ ---
25
+ ![omni.png](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/zHI_YD5JjDHEEjZ3ubUrE.png)
26
+
27
+ # **Merge QwQ Based**
28
+
29
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
30
+
31
+ ## Merge Details
32
+ ### Merge Method
33
+
34
+ This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [Qwen/QwQ-32B-Preview](https://huggingface.co/Qwen/QwQ-32B-Preview) as a base.
35
+
36
+ ### Models Merged
37
+
38
+ The following models were included in the merge:
39
+ * [prithivMLmods/Blaze.1-32B-Instruct](https://huggingface.co/prithivMLmods/Blaze.1-32B-Instruct)
40
+ * [Qwen/Qwen2.5-32B](https://huggingface.co/Qwen/Qwen2.5-32B)
41
+
42
+ ### Configuration
43
+
44
+ The following YAML configuration was used to produce this model:
45
+
46
+ ```yaml
47
+ models:
48
+ - model: prithivMLmods/Blaze.1-32B-Instruct
49
+ - model: Qwen/Qwen2.5-32B
50
+ - model: Qwen/QwQ-32B-Preview
51
+ merge_method: model_stock
52
+ base_model: Qwen/QwQ-32B-Preview
53
+ normalize: true
54
+ int8_mask: true
55
+ dtype: bfloat16
56
+ ```