Sweaterdog commited on
Commit
58f889b
·
verified ·
1 Parent(s): 2cc5fd9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -10,4 +10,6 @@ This model was trained using PPO techniques based off of examples from Open-R1,
10
 
11
  The base model was Qwen2.5 3B VL, and was trained on 51526 examples and 2 epochs of pure reasoning data, most of which were coding examples.
12
 
13
- This model is based off of techniques and dataset formatting learned from the Andy-4 series of models as well as Smol-reason2.1
 
 
 
10
 
11
  The base model was Qwen2.5 3B VL, and was trained on 51526 examples and 2 epochs of pure reasoning data, most of which were coding examples.
12
 
13
+ This model is based off of techniques and dataset formatting learned from the Andy-4 series of models as well as Smol-reason2.1
14
+
15
+ Charles is an acronym and stands for **"Conversational Helpful Assistant** *with* **Robust Logic** *and* **Extensible Skills"**