Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
jyc0325
/
Qwen2.5-1.5B-DPO-code-fix
like
0
Text Generation
Transformers
Safetensors
jyc0325/verifiable-coding-problems-python-pref
qwen2
Generated from Trainer
trl
dpo
conversational
text-generation-inference
arxiv:
2305.18290
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Qwen2.5-1.5B-DPO-code-fix
Commit History
End of training
30bd3c7
verified
jyc0325
commited on
Jul 3, 2025
Model save
4763010
verified
jyc0325
commited on
Jul 3, 2025
Training in progress, step 6000
db1f332
verified
jyc0325
commited on
Jul 3, 2025
Training in progress, step 5000
46ef6ef
verified
jyc0325
commited on
Jul 3, 2025
Training in progress, step 4000
2324623
verified
jyc0325
commited on
Jul 3, 2025
Training in progress, step 3000
6ca76d8
verified
jyc0325
commited on
Jul 3, 2025
Training in progress, step 2000
c14c6ce
verified
jyc0325
commited on
Jul 3, 2025
Training in progress, step 1000
67c3655
verified
jyc0325
commited on
Jul 3, 2025
initial commit
65e0bca
verified
jyc0325
commited on
Jul 3, 2025