defender-model / checkpoint-4 /trainer_state.json

Commit History

Defender GRPO checkpoint (Unsloth + TRL)
9b7fc67
verified

MuazTPM commited on