Model Stock: All we need is just a few fine-tuned models
Paper
β’
2403.19522
β’
Published
β’
13
This is a merge of pre-trained language models created using mergekit.
This model is a merge of all of my SOVL models, in the hopes to create the most unhinged and wild model possible.
This model was merged using the Model Stock merge method using saishf/Ortho-SOVL-8B-L3 as a base.
The following models were included in the merge:
The following YAML configuration was used to produce this model:
models:
- model: saishf/Ortho-SOVL-8B-L3
- model: saishf/Merge-Mayhem-L3-V2
- model: saishf/Merge-Mayhem-L3-V2.1
- model: saishf/SOVLish-Maid-L3-8B
merge_method: model_stock
base_model: saishf/Ortho-SOVL-8B-L3
dtype: bfloat16
Detailed results can be found here
| Metric | Value |
|---|---|
| Avg. | 67.43 |
| AI2 Reasoning Challenge (25-Shot) | 62.03 |
| HellaSwag (10-Shot) | 79.68 |
| MMLU (5-Shot) | 67.64 |
| TruthfulQA (0-shot) | 51.84 |
| Winogrande (5-shot) | 76.16 |
| GSM8k (5-shot) | 67.25 |