|
|
--- |
|
|
base_model: [] |
|
|
library_name: transformers |
|
|
tags: |
|
|
- mergekit |
|
|
- merge |
|
|
|
|
|
--- |
|
|
# prototype-0.4x244 |
|
|
|
|
|
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). |
|
|
|
|
|
## Merge Details |
|
|
### Merge Method |
|
|
|
|
|
This model was merged using the [Karcher Mean](https://en.wikipedia.org/wiki/Karcher_mean) merge method. |
|
|
|
|
|
### Models Merged |
|
|
|
|
|
The following models were included in the merge: |
|
|
* /workspace/cache/models--bruhzair--prototype-0.4x218/snapshots/6c45003b36deffe28d75ea9e1ee310efeb419693 |
|
|
* /workspace/cache/models--bruhzair--prototype-0.4x233/snapshots/af1e3c0fa6ad5b01170f023357e2406abc8642e9 |
|
|
* /workspace/cache/models--deepcogito--cogito-v1-preview-llama-70B/snapshots/1d624e2293b5b35f9cfd2349f8e02c7ebf32ca83 |
|
|
* /workspace/cache/models--bruhzair--prototype-0.4x234/snapshots/2a4d3c53dce8dfd8d3461dd84fe36e6c4df57a3b |
|
|
* /workspace/cache/models--TheDrummer--Fallen-Llama-3.3-70B-v1/snapshots/d46ef2629f1c3cd46789a55793c5ff0af60de3e8 |
|
|
* /workspace/cache/models--bruhzair--prototype-0.4x231/snapshots/3a938d7edb70c5afbd93de4e503327de332ce859 |
|
|
* /workspace/cache/models--nvidia--Llama-3.1-Nemotron-70B-Instruct-HF/snapshots/031d4042f36adc1a52cca51b331d25cbe3cf1022 |
|
|
|
|
|
### Configuration |
|
|
|
|
|
The following YAML configuration was used to produce this model: |
|
|
|
|
|
```yaml |
|
|
models: |
|
|
- model: /workspace/cache/models--bruhzair--prototype-0.4x218/snapshots/6c45003b36deffe28d75ea9e1ee310efeb419693 |
|
|
- model: /workspace/cache/models--bruhzair--prototype-0.4x231/snapshots/3a938d7edb70c5afbd93de4e503327de332ce859 |
|
|
- model: /workspace/cache/models--bruhzair--prototype-0.4x233/snapshots/af1e3c0fa6ad5b01170f023357e2406abc8642e9 |
|
|
- model: /workspace/cache/models--nvidia--Llama-3.1-Nemotron-70B-Instruct-HF/snapshots/031d4042f36adc1a52cca51b331d25cbe3cf1022 |
|
|
- model: /workspace/cache/models--bruhzair--prototype-0.4x234/snapshots/2a4d3c53dce8dfd8d3461dd84fe36e6c4df57a3b |
|
|
- model: /workspace/cache/models--deepcogito--cogito-v1-preview-llama-70B/snapshots/1d624e2293b5b35f9cfd2349f8e02c7ebf32ca83 |
|
|
- model: /workspace/cache/models--TheDrummer--Fallen-Llama-3.3-70B-v1/snapshots/d46ef2629f1c3cd46789a55793c5ff0af60de3e8 |
|
|
merge_method: karcher |
|
|
parameters: |
|
|
max_iter: 7000 |
|
|
tol: 1e-6 |
|
|
tokenizer: |
|
|
source: /workspace/cache/models--bruhzair--prototype-0.4x234/snapshots/2a4d3c53dce8dfd8d3461dd84fe36e6c4df57a3b |
|
|
chat_template: llama3 |
|
|
int8_mask: true |
|
|
dtype: bfloat16 |
|
|
``` |
|
|
|