anujathore
/

DeepSeek-R1-Distill-Qwen-1.5B-GRPO

Model card Files Files and versions

DeepSeek-R1-Distill-Qwen-1.5B-GRPO / tokenizer.json

Commit History

Training in progress, step 500

f775861
verified

anujathore commited on Jun 18, 2025

Training in progress, step 500

43de5b0
verified

anujathore commited on Apr 30, 2025

Training in progress, step 500

a33c2d3
verified

anujathore commited on Mar 19, 2025