grpo-countdown-model / tokenizer_config.json

Commit History

Upload GRPO trained model for Countdown math problems
0ad7c12
verified

jasong03 commited on