Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Ginyyds
/
DeepSeek-R1-Distill-Qwen-1.5B-GRPO
like
0
Text Generation
Transformers
Safetensors
Ginyyds/subMath220k
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Commit History
End of training
456a290
verified
Ginyyds
commited on
Feb 26, 2025
Model save
1ea7e24
verified
Ginyyds
commited on
Feb 26, 2025
End of training
77bb605
verified
Ginyyds
commited on
Feb 26, 2025
Model save
9e47ed8
verified
Ginyyds
commited on
Feb 26, 2025
End of training
b7112f3
verified
Ginyyds
commited on
Feb 26, 2025
Model save
53d75cf
verified
Ginyyds
commited on
Feb 26, 2025
Training in progress, epoch 1
6133e14
verified
Ginyyds
commited on
Feb 26, 2025
initial commit
d15630e
verified
Ginyyds
commited on
Feb 24, 2025