Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
EricLabile
/
DeepSeek-R1-Distill-Qwen-1.5B-GRPO_generation_num_4
like
0
Text Generation
Transformers
Safetensors
DigitalLearningGmbH/MATH-lighteval
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
DeepSeek-R1-Distill-Qwen-1.5B-GRPO_generation_num_4
Commit History
End of training
8cde158
verified
EricLabile
commited on
Mar 25
Model save
b9c1930
verified
EricLabile
commited on
Mar 25
Training in progress, step 208
f5f1703
verified
EricLabile
commited on
Mar 25
Training in progress, step 200
f9900fa
verified
EricLabile
commited on
Mar 25
Training in progress, step 190
1b45f49
verified
EricLabile
commited on
Mar 25
Training in progress, step 180
56e5e73
verified
EricLabile
commited on
Mar 25
Training in progress, step 170
48c2384
verified
EricLabile
commited on
Mar 25
Training in progress, step 160
3506b85
verified
EricLabile
commited on
Mar 25
Training in progress, step 150
09c1508
verified
EricLabile
commited on
Mar 25
Training in progress, step 140
a6cce49
verified
EricLabile
commited on
Mar 25
Training in progress, step 130
ce16089
verified
EricLabile
commited on
Mar 25
Training in progress, step 120
cbd3eb0
verified
EricLabile
commited on
Mar 25
Training in progress, step 110
a24f723
verified
EricLabile
commited on
Mar 25
Training in progress, step 100
a8cedd8
verified
EricLabile
commited on
Mar 25
Training in progress, step 90
089a0a4
verified
EricLabile
commited on
Mar 25
Training in progress, step 80
33f213e
verified
EricLabile
commited on
Mar 25
Training in progress, step 70
bb7ebe3
verified
EricLabile
commited on
Mar 25
Training in progress, step 60
0b454bb
verified
EricLabile
commited on
Mar 25
Training in progress, step 50
691bd7f
verified
EricLabile
commited on
Mar 25
Training in progress, step 40
8402b01
verified
EricLabile
commited on
Mar 25
Training in progress, step 30
446ebf7
verified
EricLabile
commited on
Mar 25
Training in progress, step 20
1e44671
verified
EricLabile
commited on
Mar 25
Training in progress, step 10
b2f1f8b
verified
EricLabile
commited on
Mar 25
initial commit
88aad4d
verified
EricLabile
commited on
Mar 25