FinLM-Reasoning / trainer_state.json

Commit History

GRPO 1000 steps
3f7b5a0
verified

marco-molinari commited on