2stacks commited on
Commit
32960c2
·
verified ·
1 Parent(s): 82540f0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +50 -38
README.md CHANGED
@@ -1,38 +1,50 @@
1
- ---
2
- pipeline_tag: text-generation
3
- inference: true
4
- license: apache-2.0
5
- datasets:
6
- - simplescaling/s1K-1.1
7
- base_model:
8
- - Qwen/Qwen2.5-1.5B-Instruct
9
- library_name: transformers
10
- language:
11
- - zho
12
- - eng
13
- - fra
14
- - spa
15
- - por
16
- - deu
17
- - ita
18
- - rus
19
- - jpn
20
- - kor
21
- - vie
22
- - tha
23
- - ara
24
- ---
25
-
26
- # Model Summary
27
-
28
- > s1.1-1.5B is a sucessor of [s1](https://huggingface.co/2stacks/s1-0.5B) with better reasoning performance by leveraging reasoning traces from r1 instead of Gemini. This model was created simply to test the process used to train the original s1.1 cited below using consumer grade GPUs.
29
-
30
- - **Logs:** https://wandb.ai/2stacks-sms/s1/runs/bu2ztl7d
31
- - **Repository:** [simplescaling/s1](https://github.com/simplescaling/s1)
32
- - **Paper:** https://arxiv.org/abs/2501.19393
33
-
34
- Thanks to [Ryan Marten](https://huggingface.co/ryanmarten) for helping generate r1 traces for s1K.
35
-
36
- # Use
37
-
38
- The model usage is documented [here](https://github.com/simplescaling/s1?tab=readme-ov-file#inference).
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - simplescaling/s1K-1.1
5
+ language:
6
+ - en
7
+ - fr
8
+ - zh
9
+ - es
10
+ - pt
11
+ - de
12
+ - it
13
+ - ru
14
+ - ja
15
+ - ko
16
+ - vi
17
+ - th
18
+ - ar
19
+ base_model:
20
+ - Qwen/Qwen2.5-1.5B-Instruct
21
+ pipeline_tag: text-generation
22
+ library_name: transformers
23
+ ---
24
+
25
+
26
+ # Model Summary
27
+
28
+
29
+
30
+ > s1.1-1.5B is a sucessor of [s1](https://huggingface.co/2stacks/s1-0.5B) with better reasoning performance by leveraging reasoning traces from r1 instead of Gemini. This model was created simply to test the process used to train the original s1.1 cited below using consumer grade GPUs.
31
+
32
+
33
+
34
+ - **Logs:** https://wandb.ai/2stacks-sms/s1/runs/bu2ztl7d
35
+
36
+ - **Repository:** [simplescaling/s1](https://github.com/simplescaling/s1)
37
+
38
+ - **Paper:** https://arxiv.org/abs/2501.19393
39
+
40
+
41
+
42
+ Thanks to [Ryan Marten](https://huggingface.co/ryanmarten) for helping generate r1 traces for s1K.
43
+
44
+
45
+
46
+ # Use
47
+
48
+
49
+
50
+ The model usage is documented [here](https://github.com/simplescaling/s1?tab=readme-ov-file#inference).