yashash045
/

devops-pipeline-gym-trained

Generated from Trainer

Model card Files Files and versions

Metrics Training metrics Community

devops-pipeline-gym-trained / final /ref

69.8 MB

Ctrl+K

Ctrl+K

1 contributor

History: 2 commits

yashash045's picture

GRPO retry: 300 steps from SFT, num_gen=16, max_comp_len=512

42f8547 verified 3 months ago

adapter_config.json

1.19 kB
GRPO retry: 300 steps from SFT, num_gen=16, max_comp_len=512 3 months ago
adapter_model.safetensors

69.8 MB
xet

GRPO retry: 300 steps from SFT, num_gen=16, max_comp_len=512 3 months ago