mlxha
/

DeepSeek-R1-Distill-Llama-8B-GRPO-code-2

Model card Files Files and versions

DeepSeek-R1-Distill-Llama-8B-GRPO-code-2

Commit History

Training in progress, step 60

a7d5e86
verified

mlxha commited on Apr 17, 2025

Training in progress, step 40

7f056df
verified

mlxha commited on Apr 17, 2025

Training in progress, step 20

e5ec648
verified

mlxha commited on Apr 17, 2025

initial commit

be72ca2
verified

mlxha commited on Apr 15, 2025