munish0838 commited on
Commit
59138bb
·
verified ·
1 Parent(s): b1a635f

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +232 -0
README.md ADDED
@@ -0,0 +1,232 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ ---
3
+
4
+ language:
5
+ - en
6
+ license: cc-by-nc-4.0
7
+
8
+ ---
9
+
10
+ [![QuantFactory Banner](https://lh7-rt.googleusercontent.com/docsz/AD_4nXeiuCm7c8lEwEJuRey9kiVZsRn2W-b4pWlu3-X534V3YmVuVc2ZL-NXg2RkzSOOS2JXGHutDuyyNAUtdJI65jGTo8jT9Y99tMi4H4MqL44Uc5QKG77B0d6-JfIkZHFaUA71-RtjyYZWVIhqsNZcx8-OMaA?key=xt3VSDoCbmTY7o-cwwOFwQ)](https://hf.co/QuantFactory)
11
+
12
+
13
+ # QuantFactory/MN-BackyardAI-Party-12B-v1-GGUF
14
+ This is quantized version of [Sao10K/MN-BackyardAI-Party-12B-v1](https://huggingface.co/Sao10K/MN-BackyardAI-Party-12B-v1) created using llama.cpp
15
+
16
+ # Original Model Card
17
+
18
+
19
+ Trained with compute from [Backyard.ai](https://backyard.ai/) | Thanks to them and **@dynafire** for helping me out.
20
+
21
+ Trained on 2x A100 SXM 40GB as an 8-bit LoRA.
22
+
23
+ ---
24
+
25
+ This is a group-chat based roleplaying model, based off of [12B-Lyra-v4a2](https://huggingface.co/Sao10K/MN-12B-Lyra-v4a2), a variant of Lyra-v4 that is currently private.
26
+
27
+ It is trained on an entirely human-based dataset, based on forum / internet group roleplaying styles. The only augmentation done with LLMs is to the character sheets, to fit to the system prompt, to fit various character sheets within context.
28
+
29
+ This model is still capable of 1 on 1 roleplay, though I recommend using ChatML when doing that instead.
30
+
31
+ ---
32
+
33
+ ![party](https://huggingface.co/Sao10K/MN-BackyardAI-Party-12B-v1/resolve/main/party1.png)
34
+
35
+ ---
36
+
37
+ # Formatting:
38
+
39
+ Training for the multi-character roleplaying format is done with a variant of ChatML, replaced with [INST] blocks formatted as such. Use this to draw in more of the training done.
40
+ ```
41
+ [INST]system
42
+ System Prompt Here[/INST]
43
+ [INST]user
44
+ User's Yapping[/INST]
45
+ [INST]model
46
+ Model Reply[/INST]
47
+ ```
48
+
49
+ **Relevant!**
50
+ <br> \- Turns do not need to respect `user -> model -> user`. Training is done with disjointed turns that may have repeating turns to simulate real group roleplay / chat scenarios with multiple users.
51
+ <br> \- Additional work may be required to fit for your front-end.
52
+ <br> \- Ideally character cards are all included in the turns. Training is done with this in mind. Below on the page has relevant information.
53
+ <br> \- This is a Nemo model, so lower Temperature and a sprinkling of min_p helps.
54
+ <br> \- This does require a lot of tinkering to fit within SillyTavern / other frontends.
55
+
56
+ To get better performance on Regular 1 on 1 Roleplay or Chat scenarios, use ChatML to get more of Lyra's performance.
57
+ ```
58
+ <|im_start|>system
59
+ System Prompt Here.<|im_end|>
60
+ <|im_start|>user
61
+ User's Instructions<|im_end|>
62
+ <|im_start|>assistant
63
+ Model Response<|im_end|>
64
+ ```
65
+
66
+ **For best results, set both `<|im_end|>` and `[INST]` as stopping strings.**
67
+ **Recommended Temperature is <1 , min_p of ateast 0.1**
68
+
69
+ ---
70
+
71
+ # Dataset Information:
72
+
73
+ This dataset is made from a human RP forum source, trimmed down, augmented and reformatted to fit.
74
+ <br> \- Each entry has a minimum of 6 turns to be inside
75
+ <br> \- Number of unique/main characters are ranged from 2 to 7 characters per entry.
76
+ <br> \- Each conversation is kept as is to preserve quality and uniqueness of the human data.
77
+ <br> \- Only the added system prompt makes use of the current character sheets given.
78
+
79
+ The following below is how the current Character Card / Sheets is done, which are augmented from the messy and non-uniform character sheets available. To get best results, please reformat your current character data to the on as seen below, or as similar as you can if possible.
80
+ ```
81
+ - **Character Name**:
82
+ - **Age**:
83
+ - **Race**:
84
+ - **Mageblood Type**: (if applicable)
85
+ - **Favored Magic Class**: (if applicable)
86
+ - **Previous Magic Training**: (if applicable)
87
+ - **Occupation/Profession**: (if applicable)
88
+ - **Appearance**: (if applicable)
89
+ - **Biography**: (if applicable)
90
+ - **Good Attributes**: (if applicable)
91
+ - **Bad Attributes**: (if applicable)
92
+ - **Equipment**: (if applicable)
93
+ - **Other Information**: (if applicable)
94
+ ```
95
+
96
+ Here is an example based on the above format:
97
+
98
+ ```
99
+ **Character Name**: Keri Wolf
100
+ **Age**: 21
101
+ **Race**: Vampire
102
+ **Mageblood Type**: Hydromancy
103
+ **Favored Magic Class**: Aqua
104
+ **Previous Magic Training**: Novice
105
+ **Occupation/Profession**: None specified
106
+
107
+ **Appearance**:
108
+ - Height: 5'9"
109
+ - A wooden wolf necklace around her neck, contrasting with her pale skin
110
+ - Three swords strapped to her waist
111
+ - A tattoo of a thorn vine, her family crest, on her right arm
112
+ - Normal eye color is red but changes based on her mood or the topic of conversation
113
+ - Carries a hunk of wood and a carving knife for personal activities
114
+
115
+ **Biography**:
116
+ Keri Wolf grew up in a family of adopted siblings in Djarkel. She had a normal childhood, with her best friend Satori, and was taught basic self-defense by her father. Her brothers were considered troublemakers but remained close to her. On her 21st birthday, her family was slaughtered by a vampire nest, and she was bitten. This led to her developing vampiric traits and seeking answers at the college.
117
+
118
+ **Good Attributes**:
119
+ - Easy-going
120
+ - Observant
121
+ - Helps those in trouble
122
+ - Soft-hearted
123
+ - Kind
124
+ - Cool-headed
125
+ - Good at getting out of difficult situations
126
+ - Avoids violence
127
+ - Gets along well with different people
128
+ - Loves animals
129
+
130
+ **Bad Attributes**:
131
+ - Sunlight sensitivity
132
+ - Hatred towards vampires outside the college
133
+ - Keeps feelings in check, leading to dangerous outbursts
134
+ - Cruel manner of speaking
135
+ - Thirst for revenge
136
+
137
+ **Equipment**:
138
+ - Wooden wolf necklace
139
+ - Three swords (one engraved with a rose, one engraved with her father's name, and one for decoration)
140
+ - Carving knife
141
+ - Hunk of wood
142
+ - Stealth Ring
143
+ - Knight's Shield
144
+
145
+ **Other Information**:
146
+ - Secret word: rebirth
147
+ ```
148
+
149
+ The following system prompt is augmented from available character sheets, or details from the original dataset. Placeholder names are given as shown.
150
+
151
+ ```
152
+ You are involved in a multi-character internet-style roleplaying session with a human user, who is playing as Ballbuster Steve. Do not generate dialogue for the user's character, Ballbuster Steve. Focus on the other characters.
153
+
154
+ [Human User]
155
+ Ballbuster Steve # {user}
156
+ Character Bio: [Steve's bio]
157
+
158
+ [Involved Characters]
159
+ Altair "Arty" Enzo # {char1}
160
+ Character Bio: [Arty's bio]
161
+ ---
162
+ Sukuna Gojo # {char2}
163
+ Character Bio: [Sukuna's bio]
164
+ ---
165
+
166
+ The roleplay begins now.
167
+ ```
168
+
169
+ This is how some of the turn example looks like, newlines are only for visual use.
170
+
171
+ ```
172
+ [INST]user
173
+ Ballbuster Steve: Being the doorman at a nightclub, especially one as popular as LUSH... [/INST]
174
+
175
+ [INST]model
176
+ Altair "Arty" Enzo: While he was waiting for Jake to answer, Arty noticed from the corner of his eye... [/INST]
177
+
178
+ [INST]model
179
+ Sukuna Gojo: Nick was now out of his element; he just came off his portable radio app... [/INST]
180
+
181
+ [INST]user
182
+ Ballbuster Steve: Steve grabbed his black clutch from where it was stashed under the mixing desk... [/INST]
183
+ ```
184
+
185
+ To make it easier, this is how I'd format responses for the backend:
186
+
187
+ ```
188
+ <s>[INST]system
189
+ {system_prompt}[/INST]
190
+ [INST]user
191
+ {user}: {text}[/INST]
192
+ [INST]model
193
+ {char1}: {text}[/INST]
194
+ [INST]model
195
+ {char2}: {text}[/INST]
196
+ [INST]user
197
+ {user}: {text}[/INST]
198
+ [INST]model
199
+ {char1}: {text}[/INST]<|im_end|> # For Final Turn only. Alternatively, set <|im_end|> as a stopping string.
200
+ ```
201
+
202
+ ---
203
+
204
+ # Current Issues:
205
+ ```
206
+ - Impersonation - This is a common side-effect of pure human roleplaying data, unfortunately.
207
+ Users do like writing the actions of others, though this is more limited to end of reply.
208
+ - Varied Output Quality - A swipe should be enough?
209
+ I only removed obviously bad entries. Output quality varies thanks to the variety of human users involved.
210
+ - Character Detail Confusion when in group chats
211
+ This rarely happens, but it is usually when there are too many main characters, or the bio is improperly formatted and seperated.
212
+ Or if you're using an additional, complex system prompt.
213
+ - Random OOC / Story Break moments may still exist despite me filtering the data.
214
+ - Limited Dataset Size -> 4K Varied Samples ranging from 2-7 characters per entry. I'm looking to expand.
215
+ - Limited System Prompt? -> I'm trying to improve on this.
216
+ - Fantasy-bias? -> Most of the entries are fantasy-based after all.
217
+ ```
218
+
219
+ ---
220
+
221
+ # Training Metrics
222
+
223
+ ```
224
+ n_sample: 4000
225
+ n_gpu: 2
226
+ global batch size: 12
227
+ lora: bnb_8bit
228
+ no. epochs: 3
229
+ lr: 0.000004
230
+ lr_scheduler: cosine
231
+ deepspeed: zero2
232
+ ```