Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
cameronphchen
/
Qwen2.5-1.5B-Open-R1-GRPO
like
0
Text Generation
Transformers
Safetensors
qwen2
Generated from Trainer
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
Qwen2.5-1.5B-Open-R1-GRPO
/
training_args.bin
Commit History
Training in progress, step 200
dc83856
verified
cameronphchen
commited on
Mar 5, 2025
Training in progress, step 10
23c9f4a
verified
cameronphchen
commited on
Mar 4, 2025
Training in progress, step 50
bcc9b5d
verified
cameronphchen
commited on
Mar 4, 2025
Training in progress, step 50
75c48ea
verified
cameronphchen
commited on
Mar 3, 2025
Training in progress, step 50
4d8d9ba
verified
cameronphchen
commited on
Mar 3, 2025
Training in progress, step 350
50aad55
verified
cameronphchen
commited on
Feb 28, 2025
Training in progress, step 50
82d24e6
verified
cameronphchen
commited on
Feb 28, 2025
Training in progress, step 50
d21092d
verified
cameronphchen
commited on
Feb 28, 2025
Training in progress, step 50
7c81305
verified
cameronphchen
commited on
Feb 26, 2025
Training in progress, step 50
4c16afc
verified
cameronphchen
commited on
Feb 26, 2025
Training in progress, step 50
15aceb8
verified
cameronphchen
commited on
Feb 26, 2025
Training in progress, step 50
fa54e09
verified
cameronphchen
commited on
Feb 25, 2025
Training in progress, step 40
587c404
verified
cameronphchen
commited on
Feb 25, 2025
Training in progress, epoch 1
f36a4b1
verified
cameronphchen
commited on
Feb 24, 2025
Training in progress, epoch 1
bff3a3d
verified
cameronphchen
commited on
Feb 24, 2025