defender-model / checkpoint-150 /trainer_state.json

Commit History

Defender GRPO checkpoint (Unsloth + TRL)
21ce071
verified

MuazTPM commited on