grpo-countdown-model / tokenizer.json

Commit History

Upload GRPO trained model for Countdown math problems
0ad7c12
verified

jasong03 commited on