Naphula commited on
Commit
d1383fd
·
verified ·
1 Parent(s): 26234e3

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +47 -0
README.md ADDED
@@ -0,0 +1,47 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ license: apache-2.0
4
+ tags:
5
+ - merge
6
+ - mergekit
7
+ - mistral
8
+ - nemo
9
+ - model_stock
10
+ base_model:
11
+ - mistralai/Mistral-Nemo-Instruct-2407
12
+ - LatitudeGames/Muse-12B
13
+ - allura-org/Tlacuilo-12B
14
+ ---
15
+
16
+ # 🐈 Musecuilo 12B Model_Stock
17
+
18
+ ![Musecuilo](https://cdn-uploads.huggingface.co/production/uploads/68e840caa318194c44ec2a04/vzBGv8Vlx6n2mAXbWY84S.png)
19
+
20
+ > [!NOTE]
21
+ > <span style="color:red; font-weight:bold">Note:</span> Use **Mistral Tekken** (recommended) or **ChatML** chat template for best results. The model has some refusals but can be jailbroken or ablated as needed.
22
+ >
23
+
24
+ This model was merged using the [`model_stock`](https://arxiv.org/abs/2403.19522) merge method.
25
+
26
+ Musecuilo is a merge of the following models using [mergekit](https://github.com/cg123/mergekit):
27
+ * [mistralai/Mistral-Nemo-Instruct-2407](https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407)
28
+ * [LatitudeGames/Muse-12B](https://huggingface.co/LatitudeGames/Muse-12B)
29
+ * [allura-org/Tlacuilo-12B](https://huggingface.co/allura-org/Tlacuilo-12BS)
30
+
31
+ ## 🧩 Configuration
32
+
33
+ ```yaml
34
+ architecture: MistralForCausalLM
35
+ base_model: B:/12B/mistralai--Mistral-Nemo-Instruct-2407
36
+ models:
37
+ - model: B:/12B/allura-org--Tlacuilo-12B
38
+ - model: B:/12B/LatitudeGames--Muse-12B
39
+ merge_method: model_stock
40
+ parameters:
41
+ filter_wise: true
42
+ dtype: float32
43
+ out_dtype: bfloat16
44
+ tokenizer:
45
+ source: B:/12B/LatitudeGames--Muse-12B
46
+ name: Musecuilo-12B-Model_Stock
47
+ ```