Editing Models with Task Arithmetic
Paper • 2212.04089 • Published • 7
This is a merge of pre-trained language models created using mergekit.
This model was merged using the Task Arithmetic merge method using /content/Qwen3-1.7B as a base.
The following models were included in the merge:
The following YAML configuration was used to produce this model:
# merge_task_arithmetic.yml
models:
- model: /content/drive/MyDrive/Khoá_luận_tốt_nghiệp/Model/qwen3-1.7b-gsm8k-merged
parameters:
weight: 0.65 # Giữ nguyên model chính
- model: /content/drive/MyDrive/Khoá_luận_tốt_nghiệp/Model/qwen3-1.7b-leetcode-merged
parameters:
weight: 0.35 # Scale task vector từ Pyxidis
merge_method: task_arithmetic
base_model: /content/Qwen3-1.7B # Pre-trained base (trước fine-tune)
dtype: bfloat16
tokenizer_source: union
parameters:
normalize: true
# Áp dụng weight khác nhau theo layer
int8_mask: false