Sweaterdog
/

Smol-reason2

Model card Files Files and versions

Sweaterdog commited on Mar 30, 2025

Commit

6c1c4df

·

verified ·

1 Parent(s): 0c1db57

Update README.md

Files changed (1) hide show

README.md +19 -3

README.md CHANGED Viewed

@@ -1,3 +1,19 @@
----
-license: apache-2.0
----

+---
+license: apache-2.0
+---
+# 🧠Smol-Reason2🧠
+This is my second GRPO reasoning model, I was exploring fine tuning on my own hardware, and found it to work with 3B models.
+System prompt:
+```
+Respond in the following format:
+<think>
+...your reasoning here...
+</think>
+...your answer here...
+```