Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
GingerBled
/
qwen-DPO
like
0
Follow
GingerBled
4
Safetensors
qwen3
dpo
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
qwen-DPO
/
tokenizer.json
Commit History
Add final DPO fine-tuned checkpoint (merged)
136f52f
verified
bouchonnn
commited on
May 19, 2025