Update README.md
Browse files
README.md
CHANGED
|
@@ -23,7 +23,7 @@ pipeline_tag: text-generation
|
|
| 23 |
# Information
|
| 24 |
Advanced, high-quality and lite reasoning for a tiny size that you can run on your phone.
|
| 25 |
|
| 26 |
-
Trained similarly to Deepseek R1, we used Smollm2 as a base model, then we've SFT fine tuned in on reasoning, after the SFT fine tuning we used GRPO to further amplify it's mathematics & problem solving abilities.
|
| 27 |
|
| 28 |
# Format
|
| 29 |
```
|
|
|
|
| 23 |
# Information
|
| 24 |
Advanced, high-quality and lite reasoning for a tiny size that you can run on your phone.
|
| 25 |
|
| 26 |
+
Trained similarly to Deepseek R1, we used Smollm2 as a base model, then we've SFT fine tuned in on reasoning & modified the tokenizer slightly, after the SFT fine tuning we used GRPO to further amplify it's mathematics & problem solving abilities.
|
| 27 |
|
| 28 |
# Format
|
| 29 |
```
|