decomposeRL-7b / README.md

Commit History

Update
70ca118
verified

dipta007 commited on

Update README
df8336a
verified

dipta007 commited on

Update README
257b03c
verified

dipta007 commited on

Add in-domain baseline comparison table
910375e
verified

dipta007 commited on

Trim example to 2 iterations with per-iter think blocks
663efe3
verified

dipta007 commited on

Update model card: 7 rewards, OOD coverbench, pretty-print helper, fix max-len
486896f
verified

dipta007 commited on

Add detailed model card
188754b
verified

dipta007 commited on

Unsloth Model Card
81a8ce8
verified

dipta007 commited on