FuseChat: Knowledge Fusion of Chat Models
Paper
• 2408.07990 • Published
• 14
This is a merge of pre-trained language models created using mergekit.
This model was merged using the SCE merge method using TareksLab/M-BASE-SCE as a base.
The following models were included in the merge:
The following YAML configuration was used to produce this model:
models:
- model: TareksLab/M-MERGE4
- model: TareksLab/M-MERGE3
- model: TareksLab/M-MERGE2
- model: TareksLab/M-MERGE1
merge_method: sce
base_model: TareksLab/M-BASE-SCE
parameters:
select_topk: 0.16
int8_mask: true
chat_template: llama3
tokenizer:
source: TareksLab/M-TOKENIZER-SCE
dtype: bfloat16