Update README.md
Browse files
README.md
CHANGED
|
@@ -10,4 +10,6 @@ This model was trained using PPO techniques based off of examples from Open-R1,
|
|
| 10 |
|
| 11 |
The base model was Qwen2.5 3B VL, and was trained on 51526 examples and 2 epochs of pure reasoning data, most of which were coding examples.
|
| 12 |
|
| 13 |
-
This model is based off of techniques and dataset formatting learned from the Andy-4 series of models as well as Smol-reason2.1
|
|
|
|
|
|
|
|
|
| 10 |
|
| 11 |
The base model was Qwen2.5 3B VL, and was trained on 51526 examples and 2 epochs of pure reasoning data, most of which were coding examples.
|
| 12 |
|
| 13 |
+
This model is based off of techniques and dataset formatting learned from the Andy-4 series of models as well as Smol-reason2.1
|
| 14 |
+
|
| 15 |
+
Charles is an acronym and stands for **"Conversational Helpful Assistant** *with* **Robust Logic** *and* **Extensible Skills"**
|