Rexhaif
/

anon-4b-rl-thinking

Model card Files Files and versions

anon-4b-rl-thinking

Commit History

Restore published checkpoint to Mar 30 step 3575

75db045
verified

Rexhaif commited on Apr 26

Upload selected GRPO thinking checkpoint step 3575

b7db216
verified

Rexhaif commited on Apr 24

Upload selected GRPO thinking checkpoint step 3950

4c46808
verified

Rexhaif commited on Apr 22

Replace HF-pushed RL-Thinking with bestmttask step 3725

3ac0117
verified

Rexhaif commited on Apr 18

Upload selected GRPO thinking checkpoint step 3575

90c5d26
verified

Rexhaif commited on Mar 30

initial commit

31618b3
verified

Rexhaif commited on Mar 30