DeepSeek-R1-Distill-Qwen-1.5B-GRPO / generation_config.json

Commit History

Model save
fd39aa6
verified

beichenhang commited on

Training in progress, epoch 1
19a0704
verified

beichenhang commited on