File size: 5,998 Bytes
ba2c011 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 |
---
base_model:
- unsloth/Llama-3.2-3B-Instruct
- marcuscedricridia/talon-merged-run1
- unsloth/Llama-3.2-3B-Instruct
- marcuscedricridia/talonp2lora
- unsloth/Llama-3.2-3B-Instruct
- marcuscedricridia/talonp12-helpsteer
- unsloth/Llama-3.2-3B-Instruct
- unsloth/Llama-3.2-3B-Instruct
- marcuscedricridia/talonp9-scinemotron
- unsloth/Llama-3.2-3B-Instruct
- marcuscedricridia/talonp6-mathv1nemotron
- unsloth/Llama-3.2-3B-Instruct
- marcuscedricridia/talonp4
- unsloth/Llama-3.2-3B-Instruct
- marcuscedricridia/talonp7-mathv1.1nemotron
- unsloth/Llama-3.2-3B-Instruct
- marcuscedricridia/talonp13-debug
- unsloth/Llama-3.2-3B-Instruct
- marcuscedricridia/talonp5
- unsloth/Llama-3.2-3B-Instruct
- marcuscedricridia/talonp10-safe
- unsloth/Llama-3.2-3B-Instruct
- marcuscedricridia/talonp8-code.1nemotron
- unsloth/Llama-3.2-3B-Instruct
- marcuscedricridia/talonp1lora
- unsloth/Llama-3.2-3B-Instruct
- marcuscedricridia/talonp11-fleschprose
library_name: transformers
tags:
- mergekit
- merge
---
# merge
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
## Merge Details
### Merge Method
This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using [unsloth/Llama-3.2-3B-Instruct](https://huggingface.co/unsloth/Llama-3.2-3B-Instruct) as a base.
### Models Merged
The following models were included in the merge:
* [unsloth/Llama-3.2-3B-Instruct](https://huggingface.co/unsloth/Llama-3.2-3B-Instruct) + [marcuscedricridia/talon-merged-run1](https://huggingface.co/marcuscedricridia/talon-merged-run1)
* [unsloth/Llama-3.2-3B-Instruct](https://huggingface.co/unsloth/Llama-3.2-3B-Instruct) + [marcuscedricridia/talonp2lora](https://huggingface.co/marcuscedricridia/talonp2lora)
* [unsloth/Llama-3.2-3B-Instruct](https://huggingface.co/unsloth/Llama-3.2-3B-Instruct) + [marcuscedricridia/talonp12-helpsteer](https://huggingface.co/marcuscedricridia/talonp12-helpsteer)
* [unsloth/Llama-3.2-3B-Instruct](https://huggingface.co/unsloth/Llama-3.2-3B-Instruct) + [marcuscedricridia/talonp9-scinemotron](https://huggingface.co/marcuscedricridia/talonp9-scinemotron)
* [unsloth/Llama-3.2-3B-Instruct](https://huggingface.co/unsloth/Llama-3.2-3B-Instruct) + [marcuscedricridia/talonp6-mathv1nemotron](https://huggingface.co/marcuscedricridia/talonp6-mathv1nemotron)
* [unsloth/Llama-3.2-3B-Instruct](https://huggingface.co/unsloth/Llama-3.2-3B-Instruct) + [marcuscedricridia/talonp4](https://huggingface.co/marcuscedricridia/talonp4)
* [unsloth/Llama-3.2-3B-Instruct](https://huggingface.co/unsloth/Llama-3.2-3B-Instruct) + [marcuscedricridia/talonp7-mathv1.1nemotron](https://huggingface.co/marcuscedricridia/talonp7-mathv1.1nemotron)
* [unsloth/Llama-3.2-3B-Instruct](https://huggingface.co/unsloth/Llama-3.2-3B-Instruct) + [marcuscedricridia/talonp13-debug](https://huggingface.co/marcuscedricridia/talonp13-debug)
* [unsloth/Llama-3.2-3B-Instruct](https://huggingface.co/unsloth/Llama-3.2-3B-Instruct) + [marcuscedricridia/talonp5](https://huggingface.co/marcuscedricridia/talonp5)
* [unsloth/Llama-3.2-3B-Instruct](https://huggingface.co/unsloth/Llama-3.2-3B-Instruct) + [marcuscedricridia/talonp10-safe](https://huggingface.co/marcuscedricridia/talonp10-safe)
* [unsloth/Llama-3.2-3B-Instruct](https://huggingface.co/unsloth/Llama-3.2-3B-Instruct) + [marcuscedricridia/talonp8-code.1nemotron](https://huggingface.co/marcuscedricridia/talonp8-code.1nemotron)
* [unsloth/Llama-3.2-3B-Instruct](https://huggingface.co/unsloth/Llama-3.2-3B-Instruct) + [marcuscedricridia/talonp1lora](https://huggingface.co/marcuscedricridia/talonp1lora)
* [unsloth/Llama-3.2-3B-Instruct](https://huggingface.co/unsloth/Llama-3.2-3B-Instruct) + [marcuscedricridia/talonp11-fleschprose](https://huggingface.co/marcuscedricridia/talonp11-fleschprose)
### Configuration
The following YAML configuration was used to produce this model:
```yaml
merge_method: ties
base_model: unsloth/Llama-3.2-3B-Instruct
parameters:
normalize: true
int8_mask: true
dtype: bfloat16
models:
- model: unsloth/Llama-3.2-3B-Instruct
# No parameters needed for the base model itself in this list format
- model: unsloth/Llama-3.2-3B-Instruct+marcuscedricridia/talon-merged-run1
parameters:
density: 0.5
weight: 0.07692 # 1/13
- model: unsloth/Llama-3.2-3B-Instruct+marcuscedricridia/talonp13-debug
parameters:
density: 0.5
weight: 0.07692 # 1/13
- model: unsloth/Llama-3.2-3B-Instruct+marcuscedricridia/talonp12-helpsteer
parameters:
density: 0.5
weight: 0.07692 # 1/13
- model: unsloth/Llama-3.2-3B-Instruct+marcuscedricridia/talonp11-fleschprose
parameters:
density: 0.5
weight: 0.07692 # 1/13
- model: unsloth/Llama-3.2-3B-Instruct+marcuscedricridia/talonp10-safe
parameters:
density: 0.5
weight: 0.07692 # 1/13
- model: unsloth/Llama-3.2-3B-Instruct+marcuscedricridia/talonp9-scinemotron
parameters:
density: 0.5
weight: 0.07692 # 1/13
- model: unsloth/Llama-3.2-3B-Instruct+marcuscedricridia/talonp8-code.1nemotron
parameters:
density: 0.5
weight: 0.07692 # 1/13
- model: unsloth/Llama-3.2-3B-Instruct+marcuscedricridia/talonp7-mathv1.1nemotron
parameters:
density: 0.5
weight: 0.07692 # 1/13
- model: unsloth/Llama-3.2-3B-Instruct+marcuscedricridia/talonp6-mathv1nemotron
parameters:
density: 0.5
weight: 0.07692 # 1/13
- model: unsloth/Llama-3.2-3B-Instruct+marcuscedricridia/talonp5
parameters:
density: 0.5
weight: 0.07692 # 1/13
- model: unsloth/Llama-3.2-3B-Instruct+marcuscedricridia/talonp4
parameters:
density: 0.5
weight: 0.07692 # 1/13
- model: unsloth/Llama-3.2-3B-Instruct+marcuscedricridia/talonp1lora
parameters:
density: 0.5
weight: 0.07692 # 1/13
- model: unsloth/Llama-3.2-3B-Instruct+marcuscedricridia/talonp2lora
parameters:
density: 0.5
weight: 0.07692 # 1/13
```
|