Editing Models with Task Arithmetic
Paper • 2212.04089 • Published • 7
This is a merge of pre-trained language models created using mergekit.
This model was merged using the task arithmetic merge method using openaccess-ai-collective/DPOpenHermes-7B-v2 as a base.
The following models were included in the merge:
The following YAML configuration was used to produce this model:
base_model: openaccess-ai-collective/DPOpenHermes-7B-v2
dtype: bfloat16
merge_method: task_arithmetic
slices:
- sources:
- layer_range: [0, 32]
model: openaccess-ai-collective/DPOpenHermes-7B-v2
- layer_range: [0, 32]
model: merged
parameters:
weight: 0.5
- layer_range: [0, 32]
model: SanjiWatsuki/Lelantos-7B
parameters:
weight: 0.5
Detailed results can be found here
| Metric | Value |
|---|---|
| Avg. | 70.82 |
| AI2 Reasoning Challenge (25-Shot) | 67.06 |
| HellaSwag (10-Shot) | 86.06 |
| MMLU (5-Shot) | 64.11 |
| TruthfulQA (0-shot) | 61.33 |
| Winogrande (5-shot) | 79.56 |
| GSM8k (5-shot) | 66.79 |