jash404 commited on
Commit
d157524
·
verified ·
1 Parent(s): 16ac9df

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +8 -0
README.md ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ Top-level structure:
2
+
3
+ - `RL/`
4
+ Trained GRPO runs and checkpoints.
5
+ - `EVAL/`
6
+ Post-hoc GSM8K evaluation outputs.
7
+ - `SFT/`
8
+ SFT-side artifacts and runs.