Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
kamaboko2007
/
LLM_main_002_BPO
like
0
Text Generation
Transformers
Safetensors
u-10bei/dpo-dataset-qwen-cot
English
qwen3
dpo
unsloth
qwen
alignment
conversational
text-generation-inference
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
LLM_main_002_BPO
Commit History
Upload README.md with huggingface_hub
e99322a
verified
kamaboko2007
commited on
14 days ago
(Trained with Unsloth)
3a469d4
verified
kamaboko2007
commited on
14 days ago
(Trained with Unsloth)
f043f6b
verified
kamaboko2007
commited on
14 days ago
(Trained with Unsloth)
89203be
verified
kamaboko2007
commited on
14 days ago
(Trained with Unsloth)
30f9994
verified
kamaboko2007
commited on
14 days ago
Unsloth Model Card
2301d40
verified
kamaboko2007
commited on
14 days ago
initial commit
ecfd95a
verified
kamaboko2007
commited on
14 days ago