Commit History

Update
70ca118
verified

dipta007 commited on

Update
54b21c8
verified

dipta007 commited on

Update README
df8336a
verified

dipta007 commited on

Update README
257b03c
verified

dipta007 commited on

Add in-domain baseline comparison table
910375e
verified

dipta007 commited on

Trim example to 2 iterations with per-iter think blocks
663efe3
verified

dipta007 commited on

Update model card: 7 rewards, OOD coverbench, pretty-print helper, fix max-len
486896f
verified

dipta007 commited on

Add detailed model card
188754b
verified

dipta007 commited on

(Trained with Unsloth)
f5bcc44
verified

dipta007 commited on

(Trained with Unsloth)
710d849
verified

dipta007 commited on

(Trained with Unsloth)
d78f621
verified

dipta007 commited on

(Trained with Unsloth)
1260b12
verified

dipta007 commited on

(Trained with Unsloth)
d92bbbd
verified

dipta007 commited on

(Trained with Unsloth)
336eb36
verified

dipta007 commited on

Unsloth Model Card
81a8ce8
verified

dipta007 commited on

initial commit
740bc31
verified

dipta007 commited on