Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
lisali126
/
DDR1_Q1.5B-GRPOFixReward
like
0
Safetensors
qwen2
Model card
Files
Files and versions
xet
Community
main
DDR1_Q1.5B-GRPOFixReward
Commit History
Training in progress, step 120
ed5fe7f
verified
lisali126
commited on
Dec 9, 2025
Training in progress, step 100
0486b8c
verified
lisali126
commited on
Dec 9, 2025
Training in progress, step 80
9f904de
verified
lisali126
commited on
Dec 9, 2025
Training in progress, step 60
9d3c109
verified
lisali126
commited on
Dec 9, 2025
Training in progress, step 40
de0d355
verified
lisali126
commited on
Dec 9, 2025
Training in progress, step 20
168c976
verified
lisali126
commited on
Dec 9, 2025
initial commit
b33c108
verified
lisali126
commited on
Dec 7, 2025