Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
anujathore
/
DeepSeek-R1-Distill-Qwen-1.5B-GRPO
like
0
Safetensors
qwen2
Model card
Files
Files and versions
xet
Community
main
DeepSeek-R1-Distill-Qwen-1.5B-GRPO
/
tokenizer.json
Commit History
Training in progress, step 500
f775861
verified
anujathore
commited on
Jun 18, 2025
Training in progress, step 500
43de5b0
verified
anujathore
commited on
Apr 30, 2025
Training in progress, step 500
a33c2d3
verified
anujathore
commited on
Mar 19, 2025