Update README.md
Browse files
README.md
CHANGED
|
@@ -55,10 +55,10 @@ print(tok.decode(out[0], skip_special_tokens=True))
|
|
| 55 |
## Training procedure
|
| 56 |
|
| 57 |
* ~512 warm-start steps (Alpaca-style data)
|
| 58 |
-
* 256
|
| 59 |
-
*
|
|
|
|
| 60 |
|
| 61 |
-
This model was trained with SFT.
|
| 62 |
|
| 63 |
### Framework versions
|
| 64 |
|
|
|
|
| 55 |
## Training procedure
|
| 56 |
|
| 57 |
* ~512 warm-start steps (Alpaca-style data)
|
| 58 |
+
* 256 Additional pretraining steps on (O1-OPEN/OpenO1-SFT)
|
| 59 |
+
* 128 SFT steps with (Jackrong/gpt-oss-120b-reasoning-STEM-5K)
|
| 60 |
+
* 256 SFT steps with (O1-OPEN/OpenO1-SFT)
|
| 61 |
|
|
|
|
| 62 |
|
| 63 |
### Framework versions
|
| 64 |
|