Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
kamaboko2007
/
LLM_main_003_DPO
like
0
Text Generation
Transformers
Safetensors
u-10bei/dpo-dataset-qwen-cot
English
qwen3
dpo
unsloth
qwen
alignment
conversational
text-generation-inference
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
LLM_main_003_DPO
Commit History
Upload README.md with huggingface_hub
5c9ffb3
verified
kamaboko2007
commited on
16 days ago
(Trained with Unsloth)
f4a2e37
verified
kamaboko2007
commited on
16 days ago
(Trained with Unsloth)
b7c575f
verified
kamaboko2007
commited on
16 days ago
(Trained with Unsloth)
ab17392
verified
kamaboko2007
commited on
16 days ago
(Trained with Unsloth)
846f9c6
verified
kamaboko2007
commited on
16 days ago
Unsloth Model Card
d1786ca
verified
kamaboko2007
commited on
16 days ago
initial commit
ecd10cf
verified
kamaboko2007
commited on
16 days ago