Spaces:
Sleeping
Sleeping
Upload folder using huggingface_hub
Browse files
README.md
CHANGED
|
@@ -105,11 +105,9 @@ And did and SFT to teach/align our SLM to the expected Marimo/Manim code style.<
|
|
| 105 |
|
| 106 |
## Links
|
| 107 |
|
| 108 |
-
SFT Code:
|
|
|
|
| 109 |
|
| 110 |
-
[train/sft_unsloth.py](https://gitlab.com/kgdrathan/openenv-explainer/-/blob/main/train/sft_unsloth.py)<br>
|
| 111 |
-
[training curves](https://huggingface.co/kgdrathan/ministral-3-3b-4bit-marimo-manim/blob/main/training_curves.png)<br>
|
| 112 |
-
[adapter model](https://huggingface.co/kgdrathan/ministral-3-3b-4bit-marimo-manim/)<br>
|
| 113 |
|
| 114 |
RL GRPO Code: [train/grpo_unsloth.py](https://gitlab.com/kgdrathan/openenv-explainer/-/blob/main/train/grpo_unsloth.py)
|
| 115 |
|
|
|
|
| 105 |
|
| 106 |
## Links
|
| 107 |
|
| 108 |
+
SFT Code: [train/sft_unsloth.py](https://gitlab.com/kgdrathan/openenv-explainer/-/blob/main/train/sft_unsloth.py) and [adapter model](https://huggingface.co/kgdrathan/ministral-3-3b-4bit-marimo-manim/)<br>
|
| 109 |
+
<br>
|
| 110 |
|
|
|
|
|
|
|
|
|
|
| 111 |
|
| 112 |
RL GRPO Code: [train/grpo_unsloth.py](https://gitlab.com/kgdrathan/openenv-explainer/-/blob/main/train/grpo_unsloth.py)
|
| 113 |
|