1e-5_hf_test_repeat-step-40 / adapter_config.json

Commit History

verl GRPO trained model at step 40
2afc00a
verified

thejaminator commited on