--- base_model: - migtissera/Tess-3-Mistral-Nemo-12B - ToastyPigeon/a-strange-nemo-model - ToastyPigeon/another-strange-nemo-model library_name: transformers tags: - mergekit - merge --- # merged This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the [Task Arithmetic](https://arxiv.org/abs/2212.04089) merge method using [migtissera/Tess-3-Mistral-Nemo-12B](https://huggingface.co/migtissera/Tess-3-Mistral-Nemo-12B) as a base. ### Models Merged The following models were included in the merge: * [ToastyPigeon/a-strange-nemo-model](https://huggingface.co/ToastyPigeon/a-strange-nemo-model) * [ToastyPigeon/another-strange-nemo-model](https://huggingface.co/ToastyPigeon/another-strange-nemo-model) ### Configuration The following YAML configuration was used to produce this model: ```yaml models: - model: ToastyPigeon/a-strange-nemo-model parameters: weight: 0.5 - model: ToastyPigeon/another-strange-nemo-model parameters: weight: 0.5 base_model: migtissera/Tess-3-Mistral-Nemo-12B merge_method: task_arithmetic dtype: bfloat16 ```