Gridmind / scripts /diagnose_reward.py

Commit History

feat: add script to migrate max_new_tokens from GRPOConfig to GRPOTrainer in notebook
08731ee

Prajwal782007 commited on