Macromrit commited on
Commit
d95fa21
·
verified ·
1 Parent(s): b8a2908

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -0
README.md CHANGED
@@ -8,6 +8,7 @@ base_model:
8
  - HuggingFaceTB/SmolLM2-135M-Instruct
9
  ---
10
 
 
11
 
12
  # SmolLM2-135M Fine-Tuned with GRPO on GSM8K (First 1500 Samples)
13
 
 
8
  - HuggingFaceTB/SmolLM2-135M-Instruct
9
  ---
10
 
11
+ ![GRPO Training Overview](GRPO.png)
12
 
13
  # SmolLM2-135M Fine-Tuned with GRPO on GSM8K (First 1500 Samples)
14