Nickyang commited on
Commit
e6e3044
·
verified ·
1 Parent(s): d18c25e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -14,6 +14,8 @@ library_name: transformers
14
 
15
  ## FastCuRL Overview
16
 
 
 
17
  We release **FastCuRL-1.5B-Preview**, a slow-thinking reasoning model that **outperforms** the previous SoTA *DeepScaleR-1.5B-Preview* with **50% training steps**! We adapt a novel curriculum-guided iterative lengthening reinforcement learning to the *DeepSeek-R1-Distill-Qwen-1.5B* and observe continuous performance improvement as training steps increase. To better reproduce our work and advance research progress, we open-source our code, model, and data.
18
 
19
  Code: https://github.com/nick7nlp/FastCuRL
 
14
 
15
  ## FastCuRL Overview
16
 
17
+ ### 2025-03-17
18
+
19
  We release **FastCuRL-1.5B-Preview**, a slow-thinking reasoning model that **outperforms** the previous SoTA *DeepScaleR-1.5B-Preview* with **50% training steps**! We adapt a novel curriculum-guided iterative lengthening reinforcement learning to the *DeepSeek-R1-Distill-Qwen-1.5B* and observe continuous performance improvement as training steps increase. To better reproduce our work and advance research progress, we open-source our code, model, and data.
20
 
21
  Code: https://github.com/nick7nlp/FastCuRL