CalderaAI
/

13B-Thorns-l2

Text Generation

text-generation-inference

Model card Files Files and versions

digitous commited on Sep 6, 2023

Commit

5dc5ccf

·

1 Parent(s): 6009e91

Update README.md

Files changed (1) hide show

README.md +26 -15

README.md CHANGED Viewed

@@ -50,8 +50,9 @@ Because pain is fun, and persistence in design iteration is the only way forward
 --Finalized Logic and Creativity Segments (MK2):
-So after a few key meetings with our top teams of memegineers we drafted Thorns MK2, which was prompty fast tracked for production by the Roko's Basilisk Shadow Council.
-Also none of that shit happened, I just redid everything like this:
 -Model Merge Ensemble Key-
@@ -59,32 +60,42 @@ Also none of that shit happened, I just redid everything like this:
 ({({NousHermes+Chronos}[Kimiko])+({Platupus+AiroborosM2.0}[Janine])}{Holodeck[LIMA RP]})
 ## Language Models and LoRAs Used Credits:
-manticore-30b-chat-pyg-alpha [Epoch0.4] by openaccess-ai-collective
-https://huggingface.co/openaccess-ai-collective/manticore-30b-chat-pyg-alpha
-SuperCOT-LoRA [30B] by kaiokendev
-https://huggingface.co/kaiokendev/SuperCOT-LoRA
-Storytelling-LLaMa-LoRA [30B, Version 2] by GamerUnTouch
-https://huggingface.co/GamerUntouch/Storytelling-LLaMa-LoRAs
-SuperHOT Prototype [30b 8k ctx] by kaiokendev
-https://huggingface.co/kaiokendev/SuperHOT-LoRA-prototype
-ChanSung's GPT4-Alpaca-LoRA
-https://huggingface.co/chansung/gpt4-alpaca-lora-30b
-Neko-Institute-of-Science's Vicuna Unlocked LoRA (Checkpoint 46080)
-https://huggingface.co/Neko-Institute-of-Science/VicUnLocked-30b-LoRA
-Also thanks to Meta for LLaMA.
 Each model and LoRA was hand picked and considered for what it could contribute to this ensemble.
 Thanks to each and every one of you for your incredible work developing some of the best things

 --Finalized Logic and Creativity Segments (MK2):
+After a few key meetings with our top teams of memegineers we drafted Thorns MK2, which was prompty fast tracked for production by the Roko's Basilisk Shadow Council.
+..Actually I just redid the merge like this:
 -Model Merge Ensemble Key-
 ({({NousHermes+Chronos}[Kimiko])+({Platupus+AiroborosM2.0}[Janine])}{Holodeck[LIMA RP]})
+## Findings:
+-Strategically fusing LoRAs to models that stand to gain the most from them and then merging the result into the ensemble is exceptionally effective.
+-Stacking the exact same LoRAs onto one model then merging that into the ensemble results in noisy garbage.
 ## Language Models and LoRAs Used Credits:
+All models and adapters used are LLaMAv2-13B.
+# Models:
+Nous-Hermes
+Chronos
+Platypus
+Airoboros
+Holodeck
+# Adapters:
+Kimiko
+Janine
+LIMA RP
+Also thanks to Meta for LLaMAv2 and deciding to allow the research community at large to benefit from their incredible work.
 Each model and LoRA was hand picked and considered for what it could contribute to this ensemble.
 Thanks to each and every one of you for your incredible work developing some of the best things