grpo-6000 / tokenizer.json

Commit History

Convert VERL/GRPO actor checkpoint to HF format
c6f4c30
verified

McClain commited on