Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
leonMW
/
DeepSeek-R1-Distill-Qwen-14B-GSPO-Easy
like
0
Text Generation
Transformers
Safetensors
qwen2
Generated from Trainer
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
DeepSeek-R1-Distill-Qwen-14B-GSPO-Easy
Commit History
Model save
4b9e766
verified
leonMW
commited on
Nov 29, 2025
Training in progress, epoch 5
2f55bb6
verified
leonMW
commited on
Nov 29, 2025
Training in progress, epoch 4
e2afd24
verified
leonMW
commited on
Nov 28, 2025
Training in progress, epoch 3
e31e8d1
verified
leonMW
commited on
Nov 27, 2025
Training in progress, epoch 2
40536ba
verified
leonMW
commited on
Nov 26, 2025
Training in progress, epoch 1
38504ee
verified
leonMW
commited on
Nov 26, 2025
Training in progress, epoch 2
2f6fb35
verified
leonMW
commited on
Oct 30, 2025
Training in progress, epoch 1
a881e5d
verified
leonMW
commited on
Oct 30, 2025
Training in progress, epoch 1
4b95c1e
verified
leonMW
commited on
Oct 23, 2025
Training in progress, step 200
9efc605
verified
leonMW
commited on
Oct 21, 2025
initial commit
f8c7807
verified
leonMW
commited on
Oct 9, 2025