PyTorch
qwen2

Improve language tag

#3
by lbourdois - opened
Files changed (1) hide show
  1. README.md +42 -28
README.md CHANGED
@@ -1,29 +1,43 @@
1
- ---
2
- license: apache-2.0
3
- datasets:
4
- - PrimeIntellect/SYNTHETIC-1-SFT-Data
5
- base_model:
6
- - Qwen/Qwen2.5-7B-Instruct
7
- ---
8
- # SYNTHETIC-1-7B-SFT
9
-
10
- SYNTHETIC-1-7B-SFT is an initial model trained on the SFT subset of SYNTHETIC-1, a collaboratively generated reasoning dataset from Deepseek-R1. The model largely outperforms other models based on Qwen-2.5-Instruct-7B that were trained with smaller reasoning datasets.
11
-
12
- All SYNTHETIC-1 datasets can be found in our [🤗 SYNTHETIC-1 Collection](https://huggingface.co/collections/PrimeIntellect/synthetic-1-67a2c399cfdd6c9f7fae0c37).
13
-
14
-
15
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64a32edf17b9f57eaec2ea65/Z72xymkSvMn2yNO0w2lug.png)
16
-
17
-
18
- ## Citation
19
-
20
- Feel free to cite SYNTHETIC-1 if you have found it useful for your work
21
-
22
- ```bib
23
- @misc{2025synthetic1,
24
- title={SYNTHETIC-1: Two Million Collaboratively Generated Reasoning Traces from Deepseek-R1},
25
- author={Justus Mattern and Sami Jaghouar and Manveer Basra and Jannik Straube and Matthew Di Ferrante and Felix Gabriel and Jack Min Ong and Vincent Weisser and Johannes Hagemann},
26
- year={2025},
27
- url={https://www.primeintellect.ai/blog/synthetic-1-release},
28
- }
 
 
 
 
 
 
 
 
 
 
 
 
 
 
29
  ```
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - PrimeIntellect/SYNTHETIC-1-SFT-Data
5
+ base_model:
6
+ - Qwen/Qwen2.5-7B-Instruct
7
+ language:
8
+ - zho
9
+ - eng
10
+ - fra
11
+ - spa
12
+ - por
13
+ - deu
14
+ - ita
15
+ - rus
16
+ - jpn
17
+ - kor
18
+ - vie
19
+ - tha
20
+ - ara
21
+ ---
22
+ # SYNTHETIC-1-7B-SFT
23
+
24
+ SYNTHETIC-1-7B-SFT is an initial model trained on the SFT subset of SYNTHETIC-1, a collaboratively generated reasoning dataset from Deepseek-R1. The model largely outperforms other models based on Qwen-2.5-Instruct-7B that were trained with smaller reasoning datasets.
25
+
26
+ All SYNTHETIC-1 datasets can be found in our [🤗 SYNTHETIC-1 Collection](https://huggingface.co/collections/PrimeIntellect/synthetic-1-67a2c399cfdd6c9f7fae0c37).
27
+
28
+
29
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64a32edf17b9f57eaec2ea65/Z72xymkSvMn2yNO0w2lug.png)
30
+
31
+
32
+ ## Citation
33
+
34
+ Feel free to cite SYNTHETIC-1 if you have found it useful for your work
35
+
36
+ ```bib
37
+ @misc{2025synthetic1,
38
+ title={SYNTHETIC-1: Two Million Collaboratively Generated Reasoning Traces from Deepseek-R1},
39
+ author={Justus Mattern and Sami Jaghouar and Manveer Basra and Jannik Straube and Matthew Di Ferrante and Felix Gabriel and Jack Min Ong and Vincent Weisser and Johannes Hagemann},
40
+ year={2025},
41
+ url={https://www.primeintellect.ai/blog/synthetic-1-release},
42
+ }
43
  ```