Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
leonMW
/
DeepSeek-R1-Distill-Qwen-1.5B-long-context-Staged-4
like
0
Text Generation
Transformers
Safetensors
qwen2
Generated from Trainer
grpo
trl
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
DeepSeek-R1-Distill-Qwen-1.5B-long-context-Staged-4
Commit History
Model save
5d892e0
verified
leonMW
commited on
Nov 14, 2025
Training in progress, epoch 1
96ee00d
verified
leonMW
commited on
Nov 14, 2025
Model save
a3a2845
verified
leonMW
commited on
Nov 13, 2025
Training in progress, epoch 5
98a7a00
verified
leonMW
commited on
Nov 13, 2025
Training in progress, epoch 4
53e7ca5
verified
leonMW
commited on
Nov 13, 2025
Training in progress, epoch 3
388fec4
verified
leonMW
commited on
Nov 13, 2025
Training in progress, epoch 2
905eadf
verified
leonMW
commited on
Nov 13, 2025
Training in progress, epoch 1
bd46909
verified
leonMW
commited on
Nov 13, 2025
initial commit
6edce1c
verified
leonMW
commited on
Nov 13, 2025