Improve language tag

#1
by lbourdois - opened
Files changed (1) hide show
  1. README.md +64 -51
README.md CHANGED
@@ -1,51 +1,64 @@
1
- ---
2
- base_model:
3
- - bespokelabs/Bespoke-Stratos-32B
4
- - NovaSky-AI/Sky-T1-32B-Preview
5
- - deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
6
- - Qwen/Qwen2.5-32B
7
- - Qwen/QwQ-32B-Preview
8
- library_name: transformers
9
- tags:
10
- - mergekit
11
- - merge
12
-
13
- ---
14
- # model_32bv2
15
-
16
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
17
-
18
- ## Merge Details
19
- ### Merge Method
20
-
21
- This model was merged using the sce merge method using [Qwen/Qwen2.5-32B](https://huggingface.co/Qwen/Qwen2.5-32B) as a base.
22
-
23
- ### Models Merged
24
-
25
- The following models were included in the merge:
26
- * [bespokelabs/Bespoke-Stratos-32B](https://huggingface.co/bespokelabs/Bespoke-Stratos-32B)
27
- * [NovaSky-AI/Sky-T1-32B-Preview](https://huggingface.co/NovaSky-AI/Sky-T1-32B-Preview)
28
- * [deepseek-ai/DeepSeek-R1-Distill-Qwen-32B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B)
29
- * [Qwen/QwQ-32B-Preview](https://huggingface.co/Qwen/QwQ-32B-Preview)
30
-
31
- ### Configuration
32
-
33
- The following YAML configuration was used to produce this model:
34
-
35
- ```yaml
36
- models:
37
- # Pivot model
38
- - model: Qwen/Qwen2.5-32B
39
- # Target models
40
- - model: Qwen/QwQ-32B-Preview
41
- - model: deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
42
- - model: deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
43
- - model: bespokelabs/Bespoke-Stratos-32B
44
- - model: NovaSky-AI/Sky-T1-32B-Preview
45
- merge_method: sce
46
- base_model: Qwen/Qwen2.5-32B
47
- parameters:
48
- select_topk: 1.0
49
- dtype: bfloat16
50
-
51
- ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - bespokelabs/Bespoke-Stratos-32B
4
+ - NovaSky-AI/Sky-T1-32B-Preview
5
+ - deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
6
+ - Qwen/Qwen2.5-32B
7
+ - Qwen/QwQ-32B-Preview
8
+ library_name: transformers
9
+ tags:
10
+ - mergekit
11
+ - merge
12
+ language:
13
+ - zho
14
+ - eng
15
+ - fra
16
+ - spa
17
+ - por
18
+ - deu
19
+ - ita
20
+ - rus
21
+ - jpn
22
+ - kor
23
+ - vie
24
+ - tha
25
+ - ara
26
+ ---
27
+ # model_32bv2
28
+
29
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
30
+
31
+ ## Merge Details
32
+ ### Merge Method
33
+
34
+ This model was merged using the sce merge method using [Qwen/Qwen2.5-32B](https://huggingface.co/Qwen/Qwen2.5-32B) as a base.
35
+
36
+ ### Models Merged
37
+
38
+ The following models were included in the merge:
39
+ * [bespokelabs/Bespoke-Stratos-32B](https://huggingface.co/bespokelabs/Bespoke-Stratos-32B)
40
+ * [NovaSky-AI/Sky-T1-32B-Preview](https://huggingface.co/NovaSky-AI/Sky-T1-32B-Preview)
41
+ * [deepseek-ai/DeepSeek-R1-Distill-Qwen-32B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B)
42
+ * [Qwen/QwQ-32B-Preview](https://huggingface.co/Qwen/QwQ-32B-Preview)
43
+
44
+ ### Configuration
45
+
46
+ The following YAML configuration was used to produce this model:
47
+
48
+ ```yaml
49
+ models:
50
+ # Pivot model
51
+ - model: Qwen/Qwen2.5-32B
52
+ # Target models
53
+ - model: Qwen/QwQ-32B-Preview
54
+ - model: deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
55
+ - model: deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
56
+ - model: bespokelabs/Bespoke-Stratos-32B
57
+ - model: NovaSky-AI/Sky-T1-32B-Preview
58
+ merge_method: sce
59
+ base_model: Qwen/Qwen2.5-32B
60
+ parameters:
61
+ select_topk: 1.0
62
+ dtype: bfloat16
63
+
64
+ ```