This model is a merged version of the following:
HLSN/dpo-qwen-cot-merged
HLSN/LLM_AR_2026
It was created by merging the LoRA adapter into the base model.
Chat template
Files info
Base model