Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
toshino
/
base-dpo
like
0
Text Generation
Transformers
Safetensors
u-10bei/dpo-dataset-qwen-cot
English
qwen3
dpo
unsloth
qwen
alignment
conversational
text-generation-inference
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
base-dpo
Commit History
Upload README.md with huggingface_hub
6b197f0
verified
toshino
commited on
10 days ago
(Trained with Unsloth)
d4f68ab
verified
toshino
commited on
10 days ago
(Trained with Unsloth)
f006e73
verified
toshino
commited on
10 days ago
(Trained with Unsloth)
15573b9
verified
toshino
commited on
10 days ago
(Trained with Unsloth)
5166976
verified
toshino
commited on
10 days ago
Unsloth Model Card
2d887a6
verified
toshino
commited on
10 days ago
initial commit
ef6874a
verified
toshino
commited on
10 days ago