Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
ogwata
/
exp7-dpo-baseline
like
0
Text Generation
Transformers
Safetensors
u-10bei/dpo-dataset-qwen-cot
English
qwen3
dpo
unsloth
qwen
alignment
conversational
text-generation-inference
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
exp7-dpo-baseline
Commit History
Upload README.md with huggingface_hub
1beb7ab
verified
ogwata
commited on
Feb 15
(Trained with Unsloth)
382c612
verified
ogwata
commited on
Feb 15
(Trained with Unsloth)
1d5af2c
verified
ogwata
commited on
Feb 15
(Trained with Unsloth)
81f5a25
verified
ogwata
commited on
Feb 15
Unsloth Model Card
70bf5ec
verified
ogwata
commited on
Feb 15
Upload README.md with huggingface_hub
4e4b359
verified
ogwata
commited on
Feb 13
(Trained with Unsloth)
3a61384
verified
ogwata
commited on
Feb 13
(Trained with Unsloth)
1b58d63
verified
ogwata
commited on
Feb 13
Unsloth Model Card
8d4ff65
verified
ogwata
commited on
Feb 13
Upload README.md with huggingface_hub
ffe8cce
verified
ogwata
commited on
Feb 13
(Trained with Unsloth)
7e0b377
verified
ogwata
commited on
Feb 13
(Trained with Unsloth)
1885e01
verified
ogwata
commited on
Feb 13
(Trained with Unsloth)
c4888b8
verified
ogwata
commited on
Feb 13
(Trained with Unsloth)
c237f0e
verified
ogwata
commited on
Feb 13
Unsloth Model Card
7583552
verified
ogwata
commited on
Feb 13
initial commit
256ee29
verified
ogwata
commited on
Feb 13