Model Stock: All we need is just a few fine-tuned models
Paper
•
2403.19522
•
Published
•
13
This is a merge of pre-trained language models created using mergekit.
This model was merged using the Model Stock merge method using /workspace/cache/models--TheDrummer--Anubis-Pro-105B-v1/snapshots/2bbf619c35ffcb8ae3fe7f7b5a62948aab0f3022 as a base.
The following models were included in the merge:
The following YAML configuration was used to produce this model:
base_model: /workspace/cache/models--TheDrummer--Anubis-Pro-105B-v1/snapshots/2bbf619c35ffcb8ae3fe7f7b5a62948aab0f3022
merge_method: model_stock
modules:
default:
slices:
- sources:
- layer_range: [0, 120]
model: /workspace/cache/models--bruhzair--ignore-merge-6/snapshots/87658005d40b593ba3e87e92e5fb3f28321266a1
- layer_range: [0, 120]
model: /workspace/cache/models--bruhzair--ignore-merge-8/snapshots/253c403660e94c867c277d8408e8cb518ab8bf1b
- layer_range: [0, 120]
model: /workspace/cache/models--bruhzair--ignore-merge-1/snapshots/0994ed36502b7c1942553cae165a5813acfc7f4b
- layer_range: [0, 120]
model: /workspace/cache/models--bruhzair--ignore-merge-2/snapshots/e60c43daa0fc7893e7a909d2bae952cffa3831a3
- layer_range: [0, 120]
model: /workspace/fallen2
- layer_range: [0, 120]
model: /workspace/cache/models--bruhzair--ignore-merge-15/snapshots/e60ee5422f827ab08ac0f591cf18fb7f43f629e8
- layer_range: [0, 120]
model: /workspace/cache/models--TheDrummer--Anubis-Pro-105B-v1/snapshots/2bbf619c35ffcb8ae3fe7f7b5a62948aab0f3022