DarkArtsForge
/

Magistaroth-24B-v1.1

Text Generation

text-generation-inference

Model card Files Files and versions

Naphula commited on 3 days ago

Commit

04cb468

·

verified ·

1 Parent(s): 476271e

Update README.md

Files changed (1) hide show

README.md +16 -1

README.md CHANGED Viewed

@@ -29,4 +29,19 @@ widget:
 # 🌌 Magistaroth 24B v1.1
-![Magistaroth](https://cdn-uploads.huggingface.co/production/uploads/68e840caa318194c44ec2a04/n2QI4o6Xx2d5QUhsLvmt3.png)

 # 🌌 Magistaroth 24B v1.1
+![Magistaroth](https://cdn-uploads.huggingface.co/production/uploads/68e840caa318194c44ec2a04/n2QI4o6Xx2d5QUhsLvmt3.png)
+# Merge Method
+A custom merge method known as `pdq` has been invented. Instead of using its own yaml, it acts as a post-merge processor which applies directly to the merged model using the [original yaml](https://huggingface.co/DarkArtsForge/Magistaroth-24B-v1/raw/main/mergekit_config.yml). `pdq` aims to enhance creativity by re-scanning the original donor models, encouraging them to explore the 'dark matter' regions of the vectors to synergistically augment the merged base with more unique novelty. For **Magistaroth v1.1**, I tested both the v1 `Della → PDQ → MPOA` and `Della → MPOA → PDQ`.
+It turns out that both are very creative, and the `MPOA → PDQ` is interesting because it doesn't re-introduce any refusals, however, `PDQ → MPOA` is much smarter. The difference in Q0 bench reflects this (9451 vs 12648). `Scale 1.2` was the ablation threshold required to disable refusals. This has resulted in the most creative, detailed, and uncensored variant of the configurations tested.
+### Bugs
+A small risk of increased artifacts (missing spaces, word misspelled or repeated) might be noticed due to `pdq` pushing the limits of what's possible with transformers. These are rare and can be edited out if needed.
+### Fully Uncensored
+An unablated PDQ version was also tested (it has refusals) but it seems the ablated versions are more popular so I'm just releasing this one for now.
+### Settings
+- Recommended `temp 1.0` and `topnsigma 1.25`
+- `Mistral Tekken` chat template