FinLM-Reasoning / trainer_state.json
marco-molinari's picture
GRPO 1000 steps
3f7b5a0 verified
raw
history contribute delete
528 kB
File too large to display, you can check the raw version instead.