Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
kmd2525
/
dpo-v1
like
0
Text Generation
Transformers
Safetensors
English
qwen3
dpo
unsloth
qwen
alignment
v1-improved
conversational
text-generation-inference
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
dpo-v1
Commit History
Upload README.md with huggingface_hub
c85e375
verified
kmd2525
commited on
Feb 15
(Trained with Unsloth)
2604794
verified
kmd2525
commited on
Feb 15
(Trained with Unsloth)
c4d9762
verified
kmd2525
commited on
Feb 15
(Trained with Unsloth)
01c3ba4
verified
kmd2525
commited on
Feb 15
(Trained with Unsloth)
c81a4dc
verified
kmd2525
commited on
Feb 15
Unsloth Model Card
8693a5e
verified
kmd2525
commited on
Feb 15
initial commit
4c80e8b
verified
kmd2525
commited on
Feb 15