defender-model / checkpoint-150 /training_args.bin

Commit History

Defender GRPO checkpoint (Unsloth + TRL)
21ce071
verified

MuazTPM commited on