fix: optimize GRPO trainer, ignore checkpoints and binary libs 128809c Rithwik Ravi commited on 17 days ago