DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling
Paper
•
2406.11617
•
Published
•
9
⚠️ Warning: This model can produce narratives and RP that contain violent and graphic erotic content. Adjust your system prompt accordingly, and use Mistral Tekken chat template.
architecture: MistralForCausalLM
models:
- model: B:\24B\!models--anthracite-core--Mistral-Small-3.2-24B-Instruct-2506-Text-Only
- model: B:\24B\!models--TheDrummer--Cydonia-24B-v4.3
parameters:
density: 0.666
weight: 0.25
epsilon: 0.333
- model: B:\24B\!models--ReadyArt--4.2.0-Broken-Tutu-24b
parameters:
density: 0.666
weight: 0.25
epsilon: 0.333
- model: B:\24B\!models--zerofata--MS3.2-PaintedFantasy-v2-24B
parameters:
density: 0.666
weight: 0.25
epsilon: 0.333
- model: B:\24B\!models--TheDrummer--Magidonia-24B-v4.3
parameters:
density: 0.666
weight: 0.25
epsilon: 0.333
- model: B:\24B\!models--TheDrummer--Precog-24B-v1
parameters:
density: 0.666
weight: 0.25
epsilon: 0.333
- model: B:\24B\!models--zerofata--MS3.2-PaintedFantasy-v3-24B
parameters:
density: 0.666
weight: 0.25
epsilon: 0.333
- model: B:\24B\!BeaverAI_Fallen-Mistral-Small-3.1-24B-v1e_textonly
parameters:
density: 0.666
weight: 0.25
epsilon: 0.333
- model: B:\24B\!models--ReadyArt--Broken-Tutu-24B-Transgression-v2.0
parameters:
density: 0.666
weight: 0.25
epsilon: 0.333
- model: B:\24B\!models--trashpanda-org--MS3.2-24B-Mullein-v2
parameters:
density: 0.666
weight: 0.25
epsilon: 0.333
- model: B:\24B\!models--LatitudeGames--Hearthfire-24B
parameters:
density: 0.666
weight: 0.25
epsilon: 0.333
- model: B:\24B\!models--TheDrummer--Cydonia-24B-v4.2.0
parameters:
density: 0.666
weight: 0.25
epsilon: 0.333
- model: B:\24B\!models--TheDrummer--Magidonia-24B-v4.2.0
parameters:
density: 0.666
weight: 0.25
epsilon: 0.333
- model: B:\24B\!models--ConicCat--Mistral-Small-3.2-AntiRep-24B
parameters:
density: 0.666
weight: 0.25
epsilon: 0.333
- model: B:\24B\!models--Undi95--MistralThinker-v1.1
parameters:
density: 0.666
weight: 0.25
epsilon: 0.333
- model: B:\24B\!models--CrucibleLab--M3.2-24B-Loki-V2
parameters:
density: 0.666
weight: 0.25
epsilon: 0.333
- model: B:\24B\!models--Darkhn--M3.2-24B-Animus-V7.1
parameters:
density: 0.666
weight: 0.25
epsilon: 0.333
# Seed: 420
merge_method: della
base_model: B:\24B\!models--anthracite-core--Mistral-Small-3.2-24B-Instruct-2506-Text-Only
parameters:
lambda: 1.0
normalize: false
int8_mask: false
dtype: float32
out_dtype: bfloat16
tokenizer:
source: union
chat_template: auto
name: 👺 Asmodeus-24B-v1