Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
KS150
/
testDPO
like
0
Text Generation
Transformers
Safetensors
u-10bei/dpo-dataset-qwen-cot
English
qwen3
dpo
unsloth
qwen
alignment
conversational
text-generation-inference
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
testDPO
8.06 GB
1 contributor
History:
11 commits
KS150
Upload README.md with huggingface_hub
f2126b3
verified
16 days ago
.gitattributes
Safe
1.57 kB
(Trained with Unsloth)
16 days ago
README.md
1.89 kB
Upload README.md with huggingface_hub
16 days ago
added_tokens.json
Safe
707 Bytes
(Trained with Unsloth)
16 days ago
chat_template.jinja
Safe
2.51 kB
(Trained with Unsloth)
16 days ago
config.json
Safe
1.81 kB
(Trained with Unsloth)
16 days ago
merges.txt
Safe
1.67 MB
(Trained with Unsloth)
16 days ago
model-00001-of-00002.safetensors
4.97 GB
xet
(Trained with Unsloth)
16 days ago
model-00002-of-00002.safetensors
3.08 GB
xet
(Trained with Unsloth)
16 days ago
model.safetensors.index.json
Safe
32.9 kB
(Trained with Unsloth)
16 days ago
special_tokens_map.json
Safe
614 Bytes
(Trained with Unsloth)
16 days ago
tokenizer.json
Safe
11.4 MB
xet
(Trained with Unsloth)
16 days ago
tokenizer_config.json
Safe
8.08 kB
(Trained with Unsloth)
16 days ago
vocab.json
Safe
2.78 MB
(Trained with Unsloth)
16 days ago