CaptainNemo / model /README.md
Biscotto58's picture
Upload Full Merged Model
49c54b2 verified
---
base_model:
- TheDrummer/UnslopNemo-12B-v3
- inflatebot/MN-12B-Mag-Mell-R1
library_name: transformers
tags:
- mergekit
- merge
---
# final_merge_output
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
## Merge Details
### Merge Method
This model was merged using the [Task Arithmetic](https://arxiv.org/abs/2212.04089) merge method using [TheDrummer/UnslopNemo-12B-v3](https://huggingface.co/TheDrummer/UnslopNemo-12B-v3) as a base.
### Models Merged
The following models were included in the merge:
* /teamspace/studios/this_studio/work_dir/intermediate_stages/adventure_muse_merge
* /teamspace/studios/this_studio/work_dir/intermediate_stages/unfiltered_creative_merge
* [inflatebot/MN-12B-Mag-Mell-R1](https://huggingface.co/inflatebot/MN-12B-Mag-Mell-R1)
### Configuration
The following YAML configuration was used to produce this model:
```yaml
base_model: TheDrummer/UnslopNemo-12B-v3
dtype: bfloat16
merge_method: task_arithmetic
models:
- model: /teamspace/studios/this_studio/work_dir/intermediate_stages/adventure_muse_merge
parameters:
weight: 0.4
- model: /teamspace/studios/this_studio/work_dir/intermediate_stages/unfiltered_creative_merge
parameters:
weight: 0.3
- model: inflatebot/MN-12B-Mag-Mell-R1
parameters:
weight: 0.3
parameters: null
```