--- license: apache-2.0 --- # Charles Charles is a 3B, multimodal reasoning model built for agentic coding, mathematics, and health. This model was trained using PPO techniques based off of examples from Open-R1, as well as from-scratch dataset generation from the Charles github project and Mindcraft to improve agentic tool usage. The base model was Qwen2.5 3B VL, and was trained on 51526 examples and 2 epochs of pure reasoning data, most of which were coding examples. This model is based off of techniques and dataset formatting learned from the Andy-4 series of models as well as Smol-reason2.1 Charles is an acronym and stands for: **"Conversational Helpful Assistant** *with* **Robust Logic** *and* **Extensible Skills"** I will be posting the Charles framework web app after I release the Charles LLM and prove that it works with the application well, and can outperform some larger models that aren't trained for reasoning nor agentic code use.