Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Thrillcrazyer
/
Qwen-7B_Precision_GSPO
like
0
Text Generation
Transformers
Safetensors
DeepMath-103k
qwen2
Generated from Trainer
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Qwen-7B_Precision_GSPO
Commit History
End of training
42119dd
verified
Thrillcrazyer
commited on
Jan 17
Training in progress, step 240
0dbaf3b
verified
Thrillcrazyer
commited on
Jan 17
Training in progress, step 200
95b6934
verified
Thrillcrazyer
commited on
Jan 17
Training in progress, step 150
242b434
verified
Thrillcrazyer
commited on
Jan 17
Training in progress, step 100
9d8ab55
verified
Thrillcrazyer
commited on
Jan 17
Training in progress, step 50
7007328
verified
Thrillcrazyer
commited on
Jan 17
initial commit
aa9d31b
verified
Thrillcrazyer
commited on
Jan 17