thejaminator
/

12sep_grp16_1e5_lr-step-60

Text Generation

Model card Files Files and versions

12sep_grp16_1e5_lr-step-60

698 MB

1 contributor

History: 2 commits

thejaminator's picture

verl GRPO trained model at step 60

f564542 verified 6 months ago