Resolving Interference When Merging Models
Paper • 2306.01708 • Published • 18
This is a merge of pre-trained language models created using mergekit.
Quantized using llama.cpp.
This model was merged using the TIES merge method using bardsai/jaskier-7b-dpo-v5.6 as a base.
The following models were included in the merge:
The following YAML configuration was used to produce this model:
models:
- model: bardsai/jaskier-7b-dpo-v5.6
- model: nbeerbower/bruphin-zeta
parameters:
density: 0.5
weight: 0.5
- model: Gille/StrangeMerges_16-7B-slerp
parameters:
density: 0.5
weight: 0.3
merge_method: ties
base_model: bardsai/jaskier-7b-dpo-v5.6
parameters:
normalize: true
dtype: bfloat16
4-bit