kgdrathan commited on
Commit
2b5f8f2
·
verified ·
1 Parent(s): df7c076

Upload folder using huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +2 -4
README.md CHANGED
@@ -105,11 +105,9 @@ And did and SFT to teach/align our SLM to the expected Marimo/Manim code style.<
105
 
106
  ## Links
107
 
108
- SFT Code:
 
109
 
110
- [train/sft_unsloth.py](https://gitlab.com/kgdrathan/openenv-explainer/-/blob/main/train/sft_unsloth.py)<br>
111
- [training curves](https://huggingface.co/kgdrathan/ministral-3-3b-4bit-marimo-manim/blob/main/training_curves.png)<br>
112
- [adapter model](https://huggingface.co/kgdrathan/ministral-3-3b-4bit-marimo-manim/)<br>
113
 
114
  RL GRPO Code: [train/grpo_unsloth.py](https://gitlab.com/kgdrathan/openenv-explainer/-/blob/main/train/grpo_unsloth.py)
115
 
 
105
 
106
  ## Links
107
 
108
+ SFT Code: [train/sft_unsloth.py](https://gitlab.com/kgdrathan/openenv-explainer/-/blob/main/train/sft_unsloth.py) and [adapter model](https://huggingface.co/kgdrathan/ministral-3-3b-4bit-marimo-manim/)<br>
109
+ ![training curves](https://huggingface.co/kgdrathan/ministral-3-3b-4bit-marimo-manim/resolve/main/training_curves.png)<br>
110
 
 
 
 
111
 
112
  RL GRPO Code: [train/grpo_unsloth.py](https://gitlab.com/kgdrathan/openenv-explainer/-/blob/main/train/grpo_unsloth.py)
113