PrimeIntellect
/

SYNTHETIC-1-SFT-7B

Model card Files Files and versions

Improve language tag

#3

by lbourdois - opened Apr 28, 2025

base: refs/heads/main

←

from: refs/pr/3

Discussion Files changed

Files changed (1) hide show

README.md +42 -28

README.md CHANGED Viewed

@@ -1,29 +1,43 @@
----
-license: apache-2.0
-datasets:
-- PrimeIntellect/SYNTHETIC-1-SFT-Data
-base_model:
-- Qwen/Qwen2.5-7B-Instruct
----
-# SYNTHETIC-1-7B-SFT
-SYNTHETIC-1-7B-SFT is an initial model trained on the SFT subset of SYNTHETIC-1, a collaboratively generated reasoning dataset from Deepseek-R1. The model largely outperforms other models based on Qwen-2.5-Instruct-7B that were trained with smaller reasoning datasets.
-All SYNTHETIC-1 datasets can be found in our [🤗 SYNTHETIC-1 Collection](https://huggingface.co/collections/PrimeIntellect/synthetic-1-67a2c399cfdd6c9f7fae0c37).
-![image/png](https://cdn-uploads.huggingface.co/production/uploads/64a32edf17b9f57eaec2ea65/Z72xymkSvMn2yNO0w2lug.png)
-## Citation
-Feel free to cite SYNTHETIC-1 if you have found it useful for your work
-```bib
-@misc{2025synthetic1,
-      title={SYNTHETIC-1: Two Million Collaboratively Generated Reasoning Traces from Deepseek-R1},
-      author={Justus Mattern and Sami Jaghouar and Manveer Basra and Jannik Straube and Matthew Di Ferrante and Felix Gabriel and Jack Min Ong and Vincent Weisser and Johannes Hagemann},
-      year={2025},
-      url={https://www.primeintellect.ai/blog/synthetic-1-release},
-}
 ```

+---
+license: apache-2.0
+datasets:
+- PrimeIntellect/SYNTHETIC-1-SFT-Data
+base_model:
+- Qwen/Qwen2.5-7B-Instruct
+language:
+- zho
+- eng
+- fra
+- spa
+- por
+- deu
+- ita
+- rus
+- jpn
+- kor
+- vie
+- tha
+- ara
+---
+# SYNTHETIC-1-7B-SFT
+SYNTHETIC-1-7B-SFT is an initial model trained on the SFT subset of SYNTHETIC-1, a collaboratively generated reasoning dataset from Deepseek-R1. The model largely outperforms other models based on Qwen-2.5-Instruct-7B that were trained with smaller reasoning datasets.
+All SYNTHETIC-1 datasets can be found in our [🤗 SYNTHETIC-1 Collection](https://huggingface.co/collections/PrimeIntellect/synthetic-1-67a2c399cfdd6c9f7fae0c37).
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/64a32edf17b9f57eaec2ea65/Z72xymkSvMn2yNO0w2lug.png)
+## Citation
+Feel free to cite SYNTHETIC-1 if you have found it useful for your work
+```bib
+@misc{2025synthetic1,
+      title={SYNTHETIC-1: Two Million Collaboratively Generated Reasoning Traces from Deepseek-R1},
+      author={Justus Mattern and Sami Jaghouar and Manveer Basra and Jannik Straube and Matthew Di Ferrante and Felix Gabriel and Jack Min Ong and Vincent Weisser and Johannes Hagemann},
+      year={2025},
+      url={https://www.primeintellect.ai/blog/synthetic-1-release},
+}
 ```