Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Nada2022
/
dpo-qwen-cot-merged
like
0
Text Generation
Transformers
Safetensors
u-10bei/dpo-dataset-qwen-cot
English
dpo
unsloth
qwen
alignment
conversational
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
dpo-qwen-cot-merged
dpo-qwen-cot-merged
This repository contains DPO fine-tuning artifacts.
Downloads last month
-
Downloads are not tracked for this model.
How to track
Inference Providers
NEW
Text Generation
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
Model tree for
Nada2022/dpo-qwen-cot-merged
Base model
Qwen/Qwen3-4B-Instruct-2507
Finetuned
(
1652
)
this model
Dataset used to train
Nada2022/dpo-qwen-cot-merged
u-10bei/dpo-dataset-qwen-cot
Viewer
•
Updated
Jan 23
•
4.04k
•
10
•
2