Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Blancy
/
Qwen-2.5-7B-Simple-RL
like
1
Text Generation
Transformers
Safetensors
Blancy/secondfiltered-math220k-difficulty_stratified_10k
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Qwen-2.5-7B-Simple-RL
/
tokenizer.json
Commit History
Training in progress, step 31
4dd0cd4
verified
Blancy
commited on
Mar 23, 2025
Model save
4777246
verified
Blancy
commited on
Mar 23, 2025
Training in progress, step 15
5edec54
verified
Blancy
commited on
Mar 16, 2025
Model save
77cd494
verified
Blancy
commited on
Mar 16, 2025
Training in progress, step 7
ca4401d
verified
Blancy
commited on
Mar 15, 2025
Model save
d9b432b
verified
Blancy
commited on
Mar 15, 2025
Training in progress, step 7
c5400a7
verified
Blancy
commited on
Mar 15, 2025
Model save
171b560
verified
Blancy
commited on
Mar 15, 2025
Training in progress, step 7
2d6ba23
verified
Blancy
commited on
Mar 13, 2025
Model save
ca57ebc
verified
Blancy
commited on
Mar 13, 2025
Training in progress, step 10
4e03fa7
verified
Blancy
commited on
Mar 6, 2025
Model save
6e7caea
verified
Blancy
commited on
Mar 4, 2025
Model save
feaafd4
verified
Blancy
commited on
Feb 28, 2025