Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
jyc0325
/
Qwen2.5-1.5B-ORPO-code
like
0
Text Generation
Transformers
Safetensors
jyc0325/verifiable-coding-problems-python-pref
qwen2
Generated from Trainer
trl
orpo
conversational
text-generation-inference
arxiv:
2403.07691
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Qwen2.5-1.5B-ORPO-code
Commit History
End of training
e904a1c
verified
jyc0325
commited on
Jul 2, 2025
Model save
b3cf265
verified
jyc0325
commited on
Jul 2, 2025
Training in progress, step 774
1b85c37
verified
jyc0325
commited on
Jul 2, 2025
Training in progress, step 500
4043293
verified
jyc0325
commited on
Jul 2, 2025
initial commit
e9f8520
verified
jyc0325
commited on
Jul 2, 2025