hyan
/

grpo_reasoning_model

Generated from Trainer

Model card Files Files and versions

grpo_reasoning_model

Commit History

Training in progress, step 500

3e5eeee
verified

hyan commited on 29 days ago

Training in progress, step 400

c420450
verified

hyan commited on 29 days ago

Training in progress, step 300

81f55c4
verified

hyan commited on 29 days ago

Training in progress, step 200

4927934
verified

hyan commited on 29 days ago

Training in progress, step 100

a21b893
verified

hyan commited on 29 days ago

initial commit

6e83291
verified

hyan commited on 29 days ago