reaperdoesntknow commited on
Commit
d04a7bc
·
verified ·
1 Parent(s): fc78f21

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -55,10 +55,10 @@ print(tok.decode(out[0], skip_special_tokens=True))
55
  ## Training procedure
56
 
57
  * ~512 warm-start steps (Alpaca-style data)
58
- * 256 SFT steps on `O1-OPEN/OpenO1-SFT`
59
- * +100 top-up SFT steps for reasoning behaviors
 
60
 
61
- This model was trained with SFT.
62
 
63
  ### Framework versions
64
 
 
55
  ## Training procedure
56
 
57
  * ~512 warm-start steps (Alpaca-style data)
58
+ * 256 Additional pretraining steps on (O1-OPEN/OpenO1-SFT)
59
+ * 128 SFT steps with (Jackrong/gpt-oss-120b-reasoning-STEM-5K)
60
+ * 256 SFT steps with (O1-OPEN/OpenO1-SFT)
61
 
 
62
 
63
  ### Framework versions
64