--- license: mit tags: - unsloth - trl - sft datasets: - HUGG222/R1-Like-Dataset language: - zho - eng - fra - spa - por - deu - ita - rus - jpn - kor - vie - tha - ara base_model: - Qwen/Qwen2.5-3B-Instruct ---