1e-5_hf_test_repeat-step-100 / adapter_config.json

Commit History

verl GRPO trained model at step 100
29ddc5d
verified

thejaminator commited on