Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
KS150
/
testDPO
like
0
Text Generation
Transformers
Safetensors
u-10bei/dpo-dataset-qwen-cot
English
qwen3
dpo
unsloth
qwen
alignment
conversational
text-generation-inference
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
testDPO
Commit History
Upload README.md with huggingface_hub
f2126b3
verified
KS150
commited on
16 days ago
(Trained with Unsloth)
35bd14b
verified
KS150
commited on
16 days ago
(Trained with Unsloth)
2be2710
verified
KS150
commited on
16 days ago
Unsloth Model Card
af039b0
verified
KS150
commited on
16 days ago
Upload README.md with huggingface_hub
1571c49
verified
KS150
commited on
16 days ago
(Trained with Unsloth)
6870510
verified
KS150
commited on
16 days ago
(Trained with Unsloth)
9595aae
verified
KS150
commited on
16 days ago
(Trained with Unsloth)
409321c
verified
KS150
commited on
16 days ago
(Trained with Unsloth)
8430b87
verified
KS150
commited on
16 days ago
Unsloth Model Card
8b4b518
verified
KS150
commited on
16 days ago
initial commit
232fc6b
verified
KS150
commited on
16 days ago