VizorZ0042 commited on
Commit
2959e40
·
verified ·
1 Parent(s): 82bd11a

More important changes to README.md

Browse files

Added more info to V1, V2, Temperature+Top-K, Repetition Penalty, Top-A, Repetition Slope, TFS and Additional recommendations.

Files changed (1) hide show
  1. README.md +127 -21
README.md CHANGED
@@ -54,6 +54,93 @@ ContextShift should be disabled.
54
 
55
  Flash Attention should be disalbed.
56
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
57
  # General and simplified info about specific sampler parameters:
58
  <details><summary></summary>
59
  <details><summary>Temperature:</summary>
@@ -157,27 +244,41 @@ Probability - the chance for Threshold to cutoff the desired most likely tokens.
157
  <details><summary>V1 **-CREATIVE-BALANCED-**</summary>
158
  <img src="https://gitlab.com/Azuro721/trueperfect-ai/-/raw/main/PERF1.png" style="float:right; width:200px; height:300px; padding:10px;">
159
 
160
- Very fine with very good creativity, level of details, emotional connections.
161
 
162
- Might confuse things like character names, species and etc (due to high Temperature and not low enough Top-A). **In such cases lower Repetition Penalty to 1.02612**
163
 
164
- Top-A 0.043725 might work well only with Repetition Penalty 1.02612, but better with **V2**
 
 
 
 
165
  </details>
166
  <details><summary>V2 **-INSANE-DETAIL/ATTENTION-**</summary>
167
  <img src="https://gitlab.com/Azuro721/trueperfect-ai/-/raw/main/PERF2.png" style="float:right; width:200px; height:300px; padding:10px;">
168
 
169
  OVERKILL.
170
 
171
- Maintains insane amount of details, attention, accuracy, and length.
172
 
173
- Top-K 206 might improve attention to details, but not generally recommended for Temperature 4.8, mostly due to more focused output on certain things, followed by transitions, which lowers the cases with better creativity and some supposed variations in one output.
174
 
175
- Lower Repetition Penalty to get even more insane attention and accuracy, but sacrifice a bit of creativity.
 
 
 
 
 
 
176
  </details>
177
  <details><summary>V2-Alt **-INSANE-DETAIL/ATTENTION-ASSISTANT-**</summary>
178
  <img src="https://gitlab.com/Azuro721/trueperfect-ai/-/raw/main/PERF3.png" style="float:right; width:200px; height:300px; padding:10px;">
179
 
180
  Insane for maximum accuracy for ASSISTANT tasks; can be used with overly complexive RP scenarios (with favor of attention). Will be less creative in favor of attention.
 
 
 
 
181
  </details>
182
 
183
  To preserve the versatility, I would like to describe complete and specific sampler values below **Additional fine-tuning:**, to aim perfection for any case.
@@ -201,6 +302,8 @@ Lower temperature will be output less "exciting" and creative results (like less
201
 
202
  Higher Top-K will expand the attention to smaller details, and preserve attention to multiple simultaneous events, and also can fix smaller text-related issues (like with quotation marks, asterisks, hyphens and etc.)
203
 
 
 
204
  Top-K 206 is the more attentive one, which fits more with assistant tasks, as it will take away some of creativity, so might not be suitable for Temperature 4.8.
205
 
206
  Top-K 134 is the middle-balanced one, with better creativity, good level of detail and fine transitions. Recommeded one for in-character actions and strong roleplay scenarios.
@@ -212,21 +315,21 @@ Temperature 4.8 with Top-K 206 and TFS 0.8413 will output more creative, emotion
212
  Temperature 2.4 with Top-K 206 and TFS 0.8413 will output more attentive results, with less variety, emotions and "surprising" moments.
213
 
214
  Temperature 1.2 with Top-K 206 and TFS 0.8413 will output slightly more attentive results, with even less variety, emotions and "surprising" moments.
 
 
215
  </details>
216
  <details><summary>Repetition Penalty:</summary>
217
- Base value: 1.12082, which will output more creative, emotional, varied, smart and "exciting" results. But tends to have issues with asterisks and quotation marks. Good as an **assistant**. Similar to 1.02612, but with more creativity, less descriptions, faster pace, more interesting responses across multiple characters simultaneously.
218
-
219
- 1.121: an alternative variant of 1.12082, which might fix issues with overly descriptive outputs, but 1.12082 is preferred and context and/or initial input (first) should be adjusted instead.
220
 
221
  1.105 (untuned): specific value I found out during experimentation. Will output less "exciting" results, but fairly better compared to 1.05.
222
 
223
- 1.05 (untuned, not recommended): base value, which is widely used in various LLMs. Will output more casual values, but also omit issues with asterisks and quotation marks.
224
 
225
  1.02612: very specific one; will output very descriptive, attentive and expanded results. Will try to pay attention to noticeably more things compared to other variants. Will preserve character details and much more things as events go by. Great as an **assistant.**. Great for very complex instructions, very complex character cards and complex scenes. Great attention to multiple characters.
226
 
227
- **Other values (might output unstable results):**
228
 
229
- Feel free to experiment with and output any results.
230
 
231
  1.02665: similar to 1.02695, but slightly more altered creativity, with interesting developement of events, more realistic responses from other characters; no issues so far; closest to the 1.02612 one; haven't been tested thoroughly.
232
 
@@ -238,6 +341,9 @@ Feel free to experiment with and output any results.
238
 
239
  1.0285/1.0286: will provide very interesting and creative responses, but will mess up some character details (mostly the ones that already in LLM's database)
240
  Some of them will output inconsistency with specific character parts, like body type, skin type and etc; also missing out certain details and skipping some important parts, be sure to include that and select the best one.
 
 
 
241
  </details>
242
  <details><summary>Top-P:</summary>
243
  Base value: 0.915, which will output very attentive, consistent and stable results. Recommeded for all cases.
@@ -248,16 +354,16 @@ Base value: 0.915, which will output very attentive, consistent and stable resul
248
  </details>
249
  <details><summary>Top-A:</summary>
250
  Base value: 0.07, which will output extremely consistent, stable results, with smooth transitions and very good attention to most things. Recommeded for all cases.
251
- </details>
252
  <details><summary>Other values (experimental):</summary>
253
- 0.043725: more attention to anatomy, but unstable and tends to be unpredictable. Works well with Repetition Penalty 1.02612
254
 
255
- 0.2025: more creative, descriptive, "exciting" and emotional, but tends to skip some details and be slightly more stable compared to 0.045. Less accurate and prone to have more issues with such high Temperature.
 
256
  </details>
257
  <details><summary>Repetition Range:</summary>
258
  Base value: 64, which is the least one that will output overall better, consistant and descriptive results. Recommeded for all cases.
259
 
260
- 128: will output more descriptive results, but tends to be repetitive. Not recommended.
261
  </details>
262
  <details><summary>Seed:</summary>
263
  Use fixed seed to improve consistency alot.
@@ -277,19 +383,19 @@ Feel free to use these fixed values:
277
  132458
278
  </details>
279
  <details><summary>Repetition Slope:</summary>
280
- Base value: 1.12, which will help to max out most things like consistency, level of detail and etc. Recommeded for all cases. Higher values won't output any gain.
281
 
282
- All values below 1 are unstable and will cause issues.
283
  </details>
284
  <details><summary>TFS:</summary>
285
  Base value: 0.8413, which will output very smart, attentive and smooth outputs. Recommeded for all cases.
286
 
287
- 0.9551: will output extremely descriptive and attentive results; the best one as an **assistant**; might cause issues in some LLMs, specifically overly descriptive and attentive results.
288
  </details>
289
  <details><summary>(Additional) recommendations for assistant mode+other tips:</summary>
290
- Disable "Inject Chatnames" to get best results for a personal assistant.
291
 
292
- Usage of "Separate End Tags" might improve the cases with repetitions and make responses smarter and overall better for some models.
293
  </details>
294
  </details>
295
  </details>
 
54
 
55
  Flash Attention should be disalbed.
56
 
57
+ ><summary>Smoothing Factor:</summary>
58
+ Avoid in any case.
59
+
60
+ Increases stability of outputs with very randomized settings, but will cause incorrect choices later, and will cause issues with strict settings, like nonsensical choices, actions, events and etc (even with low values like 0.002).
61
+ </details>
62
+ <details><summary>Mirostat:</summary>
63
+ Helps to steady the temperature (might replace) and dynamically adjusts the effective temperature based on more "surprising" tokens.
64
+
65
+ Tau helps to adjust the diversity; higher - more diverse and creative; lower - more deterministic.
66
+
67
+ Eta helps to increase the frequency of temperature updates; smaller - stable and slow; higher - faster but less coherent.
68
+
69
+ Never managed to get stable enough results for very long period of progression, so I personally avoid it and use strict settings instead.
70
+ </details>
71
+ <details><summary>Smoothing Curve (new):</summary>
72
+ Dynamically adjusts the Penalty, temperature, probability to avoid sudden changes; 1 and lower.
73
+
74
+ Has stronger effect with higher Repetition Penalty.
75
+
76
+ Avoid with strict settings.
77
+
78
+ Values below 0.96 are not recommended.
79
+ </details>
80
+ <details><summary>Adaptive-P (new):</summary>
81
+ Does not work with high Temperature; Avoid at any case.
82
+ </details>
83
+ <details><summary>DRY:</summary>
84
+ Avoid in any case.
85
+
86
+ Helps to steady any repetitions, but never managed to get stable enough outputs.
87
+ </details>
88
+ <details><summary>XTC:</summary>
89
+ Threshold - helps to cutoff the high-probability tokens (most likely), which mostly helps with lower Temperature.
90
+
91
+ Probability - the chance for Threshold to cutoff the desired most likely tokens.
92
+ </details>
93
+ </details>
94
+
95
+ # Ready-to-use AIO general settings:
96
+ <details><summary></summary>
97
+ <details><summary>V1 **-CREATIVE-BALANCED-**</summary>
98
+ <img src="https://gitlab.com/Azuro721/trueperfect-ai/-/raw/main/PERF1.png" style="float:right; width:200px; height:300px; padding:10px;">
99
+
100
+ Very fine with very good creativity, level of details, emotional connections.
101
+
102
+ Might confuse things like character names, species and etc (due to high Temperature and not low enough Top-A). **In such cases lower Repetition Penalty to 1.02612**
103
+
104
+ Top-A 0.043725 might work well only with Repetition Penalty 1.02612, but better with **V2**
105
+ </details>
106
+ <details><summary>V2 **-INSANE-DETAIL/ATTENTION-**</summary>
107
+ <img src="https://gitlab.com/Azuro721/trueperfect-ai/-/raw/main/PERF2.png" style="float:right; width:200px; height:300px; padding:10px;">
108
+
109
+ OVERKILL.
110
+
111
+ Maintains insane amount of details, attention, accuracy, and length.
112
+
113
+ Top-K 206 might improve attention to details, but not generally recommended for Temperature 4.8, mostly due to more focused output on certain things, followed by transitions, which lowers the cases with better creativity and some supposed variations in one output.
114
+
115
+ Lower Repetition Penalty to get even more insane attention and accuracy, but sacrifice a bit of creativity.
116
+ </details>
117
+ <details><summary>V2-Alt **-INSANE-DETAIL/ATTENTION-ASSISTANT-**</summary>
118
+ <img src="https://gitlab.com/Azuro721/trueperfect-ai/-/raw/main/PERF3.png" style="float:right; width:200px; height:300px; padding:10px;">
119
+
120
+ Insane for maximum accuracy for ASSISTANT tasks; can be used with overly complexive RP scenarios (with favor of attention). Will be less creative in favor of attention.
121
+ </details>
122
+
123
+ To preserve the versatility, I would like to describe complete and specific sampler values below **Additional fine-tuning:**, to aim perfection for any case.
124
+
125
+ Here I will describe additional values for special cases, as well as dependencies across different sampler parameters (Like Temperature+Top-K)
126
+ <details><summary>Additional fine-tuning:</summary>
127
+ <details><summary>Temperature+Top-K (only higher values):</summary>
128
+ Temperature is related to Top-K, and in order to achieve perfect part for this specific parameters, both of these needs to be adjusted.
129
+
130
+ For example Temperature 2.4 **needs** to have at least Top-K 134, increased by 72.
131
+
132
+ **Acceptable values of Top-K for Temperature 2.4/4.8**: 134, 206.
133
+
134
+ Different values will cause inconsistency, instability and other issues.
135
+
136
+ **Acceptable values of Temperature for Top-K**: 2.4, 4.8.
137
+
138
+ (1.2) is not prioritized mostly because in most cases it will output unsatisfactory results (e.g. bland and boring).
139
+
140
+ Lower temperature will be output less "exciting" and creative results (like less emotions, variety and predictability by one output), and might trigger repetitions, which can be mostly fixed by raising Top-K.
141
+
142
+ Higher Top-K will expand the attention to smaller details, and preserve attention to multiple simultaneous events, and also can fix smaller text-related issues (like with quotation marks, asterisks, hyphens and etc.)
143
+
144
  # General and simplified info about specific sampler parameters:
145
  <details><summary></summary>
146
  <details><summary>Temperature:</summary>
 
244
  <details><summary>V1 **-CREATIVE-BALANCED-**</summary>
245
  <img src="https://gitlab.com/Azuro721/trueperfect-ai/-/raw/main/PERF1.png" style="float:right; width:200px; height:300px; padding:10px;">
246
 
247
+ Very fine with very good creativity, level of detail, emotional connections, "surprising" outcomes and descriptions.
248
 
249
+ In some cases where Repetition Penalty 1.12082 outputs overly descriptive results, try out 1.121, it might provide better results and fix such issues; but in most cases adjusting the typos and logical inconsistency from instructions or character cards will fix such issues, so 1.121 is generally not recommended.
250
 
251
+ Might confuse things like character names, species, User actions and etc (due to high Temperature alongside with not low enough Top-A). **In such cases lower Repetition Penalty to 1.02612; this will fix most issues and alter the outputs in a more focused and less creative way, with attention to finer details**.
252
+
253
+ In this case, Top-K 206 with Repetition Penalty 1.02612 will improve attention to finer details with less attention to creative things and slower progression. **Top-K 134 (with Repetition Penalty 1.02612) is generally more preferred due to being more subtle, with faster progression (good in most cases) and better creative outcomes**.
254
+
255
+ In some specific models (most likely), TFS 0.9551 might improve things even further, with more attention, creativity, and performance overall; do not use if you notice overly long descriptions (extremely long after each section).
256
  </details>
257
  <details><summary>V2 **-INSANE-DETAIL/ATTENTION-**</summary>
258
  <img src="https://gitlab.com/Azuro721/trueperfect-ai/-/raw/main/PERF2.png" style="float:right; width:200px; height:300px; padding:10px;">
259
 
260
  OVERKILL.
261
 
262
+ **RECOMENDATIONS:** use only if you want insane level of detail and attention; V1 is more preferred for general role play and good reactions, emotional connections, creativity and shorter outputs with well-preserved details. Use V2 only with very complex cases, or if you want extreme level of detail with very high attention and slow transisitons during certain events, as well as attention to very fine descriptions.
263
 
264
+ Maintains insane amount of details, attention, accuracy, and length: focused outputs, "surprising" outcomes and descriptions (noticeably (in some models) less compared to Top-A 0.07 but still generally good).
265
 
266
+ Lower Repetition Penalty to get even more insane attention and level of detail, but sacrifice a bit of creativity, "surprising" outcomes and descriptions (especially with 1.02612).
267
+
268
+ Repetition Penalty 1.02612 is preferred as the lowest point; will output very attentive and detailed descriptions, with other things described earlier.
269
+
270
+ In this case, Top-K 206 will increase attention even further.
271
+
272
+ Use Top-K 134 for faster transitions and attention to more "surprising" moments.
273
  </details>
274
  <details><summary>V2-Alt **-INSANE-DETAIL/ATTENTION-ASSISTANT-**</summary>
275
  <img src="https://gitlab.com/Azuro721/trueperfect-ai/-/raw/main/PERF3.png" style="float:right; width:200px; height:300px; padding:10px;">
276
 
277
  Insane for maximum accuracy for ASSISTANT tasks; can be used with overly complexive RP scenarios (with favor of attention). Will be less creative in favor of attention.
278
+
279
+ Generally, Top-K 278 might improve attention to details, but not recommended for Temperature 2.4 and other cases, mostly due to increased chance of becoming overly descriptive, which will confuse model and force hallucinations.
280
+
281
+ In rare cases (specific models) with overly descriptive results, decrease TFS to 0.8413 or/and Top-K to 134.
282
  </details>
283
 
284
  To preserve the versatility, I would like to describe complete and specific sampler values below **Additional fine-tuning:**, to aim perfection for any case.
 
302
 
303
  Higher Top-K will expand the attention to smaller details, and preserve attention to multiple simultaneous events, and also can fix smaller text-related issues (like with quotation marks, asterisks, hyphens and etc.)
304
 
305
+ Top-K 278, as described earlier, might cause overly descriptive results, which will most likely cause hallucinations.
306
+
307
  Top-K 206 is the more attentive one, which fits more with assistant tasks, as it will take away some of creativity, so might not be suitable for Temperature 4.8.
308
 
309
  Top-K 134 is the middle-balanced one, with better creativity, good level of detail and fine transitions. Recommeded one for in-character actions and strong roleplay scenarios.
 
315
  Temperature 2.4 with Top-K 206 and TFS 0.8413 will output more attentive results, with less variety, emotions and "surprising" moments.
316
 
317
  Temperature 1.2 with Top-K 206 and TFS 0.8413 will output slightly more attentive results, with even less variety, emotions and "surprising" moments.
318
+
319
+ Further experimentation with Top-K might not be possible, mostly due to logical limit for all settings combined.
320
  </details>
321
  <details><summary>Repetition Penalty:</summary>
322
+ Base value: 1.12082, which will output more creative, emotional, varied, smart and "exciting" results. But tends to have issues with asterisks and quotation marks; similar to 1.02612, but with more creativity, less descriptions, faster pace, but prone to issues if input has logical inconsistencies or lots of typos.
 
 
323
 
324
  1.105 (untuned): specific value I found out during experimentation. Will output less "exciting" results, but fairly better compared to 1.05.
325
 
326
+ 1.05 (untuned, not recommended): base value, which is widely used in various LLMs. Might output focused results with average creativity (better than 1.02612), but prone to be less stable compared to 1.105.
327
 
328
  1.02612: very specific one; will output very descriptive, attentive and expanded results. Will try to pay attention to noticeably more things compared to other variants. Will preserve character details and much more things as events go by. Great as an **assistant.**. Great for very complex instructions, very complex character cards and complex scenes. Great attention to multiple characters.
329
 
330
+ <details><summary>Other values (might output unstable results):</summary>
331
 
332
+ Feel free to experiment with these variants, and show any good results (if stable enough to be used for at least ~6K tokens).
333
 
334
  1.02665: similar to 1.02695, but slightly more altered creativity, with interesting developement of events, more realistic responses from other characters; no issues so far; closest to the 1.02612 one; haven't been tested thoroughly.
335
 
 
341
 
342
  1.0285/1.0286: will provide very interesting and creative responses, but will mess up some character details (mostly the ones that already in LLM's database)
343
  Some of them will output inconsistency with specific character parts, like body type, skin type and etc; also missing out certain details and skipping some important parts, be sure to include that and select the best one.
344
+
345
+ Might output quite stable results if used with Top-A 0.0001.
346
+ </details>
347
  </details>
348
  <details><summary>Top-P:</summary>
349
  Base value: 0.915, which will output very attentive, consistent and stable results. Recommeded for all cases.
 
354
  </details>
355
  <details><summary>Top-A:</summary>
356
  Base value: 0.07, which will output extremely consistent, stable results, with smooth transitions and very good attention to most things. Recommeded for all cases.
 
357
  <details><summary>Other values (experimental):</summary>
358
+ 0.043725: more attention to anatomy, but unstable and tends to be unpredictable. Works better with Repetition Penalty 1.02612 or **V2** but goes bad soon, after several thousand tokens.
359
 
360
+ 0.2025: more creative, descriptive, "exciting" and emotional, but tends to skip some details. Less accurate and have lots of issues with high Temperature; not suitable to be used generally, only to get initial creative inputs.
361
+ </details>
362
  </details>
363
  <details><summary>Repetition Range:</summary>
364
  Base value: 64, which is the least one that will output overall better, consistant and descriptive results. Recommeded for all cases.
365
 
366
+ 128: will output more descriptive results, but tends to be repetitive. Not recommended, but cab be used to adjust initial responses.
367
  </details>
368
  <details><summary>Seed:</summary>
369
  Use fixed seed to improve consistency alot.
 
383
  132458
384
  </details>
385
  <details><summary>Repetition Slope:</summary>
386
+ Base value: 1.12, which will help to max out most things like consistency, level of detail and etc. Recommeded for all cases. Higher values won't affect outputs at all, only might cause issues with slowdowns.
387
 
388
+ All values below 1 are unstable and will cause very random issues.
389
  </details>
390
  <details><summary>TFS:</summary>
391
  Base value: 0.8413, which will output very smart, attentive and smooth outputs. Recommeded for all cases.
392
 
393
+ 0.9551: will output extremely descriptive and attentive results; the best one as an **assistant**; might cause issues in some LLMs as described earlier.
394
  </details>
395
  <details><summary>(Additional) recommendations for assistant mode+other tips:</summary>
396
+ Disable "Inject ChatNames" to get the best results for assistant-related tasks.
397
 
398
+ Usage of "Separate End Tags" might improve the cases with repetitions, make responses smarter and overall better for some models.
399
  </details>
400
  </details>
401
  </details>