Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
naru0411
/
LLM-competition-DPO
like
0
Text Generation
Transformers
Safetensors
u-10bei/dpo-dataset-qwen-cot
English
qwen3
dpo
qwen
alignment
silent-cot
structured-output
conversational
text-generation-inference
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
LLM-competition-DPO
Commit History
Upload README.md with huggingface_hub
50cc96e
verified
naru0411
commited on
24 days ago
(Trained with Unsloth)
65381ed
verified
naru0411
commited on
24 days ago
(Trained with Unsloth)
6d5461e
verified
naru0411
commited on
24 days ago
Unsloth Model Card
b501418
verified
naru0411
commited on
24 days ago
Upload README.md with huggingface_hub
1c6d977
verified
naru0411
commited on
24 days ago
(Trained with Unsloth)
7b155bf
verified
naru0411
commited on
24 days ago
(Trained with Unsloth)
1658f8a
verified
naru0411
commited on
24 days ago
Unsloth Model Card
6926a4a
verified
naru0411
commited on
24 days ago
Upload README.md with huggingface_hub
7a45e17
verified
naru0411
commited on
25 days ago
(Trained with Unsloth)
1448623
verified
naru0411
commited on
25 days ago
(Trained with Unsloth)
5b157b2
verified
naru0411
commited on
25 days ago
(Trained with Unsloth)
c095e35
verified
naru0411
commited on
25 days ago
(Trained with Unsloth)
f07a849
verified
naru0411
commited on
25 days ago
Unsloth Model Card
caa69d7
verified
naru0411
commited on
25 days ago
initial commit
c4c8ae4
verified
naru0411
commited on
25 days ago