Nada2022
/

dpo-qwen-cot-merged

Text Generation

Model card Files Files and versions

dpo-qwen-cot-merged

This repository contains DPO fine-tuning artifacts.

Downloads last month: -; Downloads are not tracked for this model. How to track

Model tree for Nada2022/dpo-qwen-cot-merged

Base model

Qwen/Qwen3-4B-Instruct-2507

Finetuned

(1652)

this model

Dataset used to train Nada2022/dpo-qwen-cot-merged