| --- |
| base_model: |
| - TheDrummer/UnslopNemo-12B-v3 |
| - inflatebot/MN-12B-Mag-Mell-R1 |
| library_name: transformers |
| tags: |
| - mergekit |
| - merge |
|
|
| --- |
| # final_merge_output |
|
|
| This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). |
|
|
| ## Merge Details |
| ### Merge Method |
|
|
| This model was merged using the [Task Arithmetic](https://arxiv.org/abs/2212.04089) merge method using [TheDrummer/UnslopNemo-12B-v3](https://huggingface.co/TheDrummer/UnslopNemo-12B-v3) as a base. |
|
|
| ### Models Merged |
|
|
| The following models were included in the merge: |
| * /teamspace/studios/this_studio/work_dir/intermediate_stages/adventure_muse_merge |
| * /teamspace/studios/this_studio/work_dir/intermediate_stages/unfiltered_creative_merge |
| * [inflatebot/MN-12B-Mag-Mell-R1](https://huggingface.co/inflatebot/MN-12B-Mag-Mell-R1) |
|
|
| ### Configuration |
|
|
| The following YAML configuration was used to produce this model: |
|
|
| ```yaml |
| base_model: TheDrummer/UnslopNemo-12B-v3 |
| dtype: bfloat16 |
| merge_method: task_arithmetic |
| models: |
| - model: /teamspace/studios/this_studio/work_dir/intermediate_stages/adventure_muse_merge |
| parameters: |
| weight: 0.4 |
| - model: /teamspace/studios/this_studio/work_dir/intermediate_stages/unfiltered_creative_merge |
| parameters: |
| weight: 0.3 |
| - model: inflatebot/MN-12B-Mag-Mell-R1 |
| parameters: |
| weight: 0.3 |
| parameters: null |
| |
| ``` |
|
|