Merged Model: HLSN/merged-dpo-qwen-cot-2026

This model is a merged version of the following:

  • Base Model: HLSN/dpo-qwen-cot-merged
  • Adapter: HLSN/LLM_AR_2026

It was created by merging the LoRA adapter into the base model.

Downloads last month
-
Safetensors
Model size
4B params
Tensor type
F16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for HLSN/merged-dpo-qwen-cot-2026

Adapter
(1)
this model