defender-model / checkpoint-25 /optimizer.pt

Commit History

Defender GRPO checkpoint (Unsloth + TRL)
9b7fc67
verified

MuazTPM commited on