overhead520 commited on
Commit
2c8ada2
·
verified ·
1 Parent(s): 20b2195

Added Ministral 3 tip to enable reasoning.

Browse files
Files changed (1) hide show
  1. index.html +16 -11
index.html CHANGED
@@ -86,7 +86,7 @@
86
  <li class="⚙️">Top_nSigma 1.25 (this sampler enable better higher temperature and creativity, without the drawbacks)</li>
87
  <li class="⚙️">Since I'm usually using LM Studio as a backend, I've not played much with the excellent XTC, DRY and nSignma samplers.</li>
88
 
89
- <li class="🤖"><emo>🍺</emo> Most universal <b>context &amp; instruct templates</b>: ChatML, or ChatML <emo>💭</emo>Reasoning </li>
90
  <li class="⚙️">Very old models can also be used with the Alpaca template.</li>
91
  <li class="⚙️">For most recent models, connecting to your backend via "Chat Completion API" removes the need to select a template.</li>
92
 
@@ -293,7 +293,7 @@
293
  <li class="🏢" id="🏢Xiaomi"><i>Xiaomi</i><flag>🇨🇳</flag></li>
294
 
295
  <li class="🤖" id="🤖MiMo 2 Flash"><b>MiMo 2 Flash</b></li>
296
- <li class="⚖️ 👎"><b>Clueless at Roleplay</b> <emo>😵</emo> The model was clearly not designed for roleplay. At least with the Q2_K_XL version I tested locally, responses were unatural, prone to looping, and emotionaly flat.</li>
297
  <li class="⚙️">Temperature 0.8</li>
298
  <li class="⚙️">Top_P 0.95</li>
299
  <li class="⚙️">You'll have to use Chat Completion API to connect</li>
@@ -325,25 +325,31 @@
325
  <li class="▶️▶️ ⚙️">Temperature 0.15 or 0.1</li>
326
  <li class="▶️▶️ ⚙️">Top_P 1.0</li>
327
  <li class="▶️"><emo>💭</emo> Reasoning usage</li>
328
- <li class="▶️▶️ ⚙️">Temperature 0.17</li>
329
  <li class="▶️▶️ ⚙️">Top_P 0.95</li>
330
  <li class="🍺"><emo>💭</emo> Reasoning Formatting: [THINK] [/THINK]</li>
331
- <li class="🍺"><emo>🍺</emo> Not sure yet how to trigger reasonning in Silly Tavern</li>
 
 
 
 
 
332
 
333
- <li class="🍺"><emo>🍺</emo> If using Chat Connection API, set "Prompt Post-Processing" to Strict</li>
334
  <li class="🍺"><emo>🍺</emo> Ramble way too much? Use <a href="https://huggingface.co/sleepdeprived3/Mistral-V7-Tekken-Concise">Mistral-V7-Tekken-Concise prompt</a></li>
335
- <li class="🍺"><emo>🍺</emo> Guide to selecting the correct Mistral template</li>
336
  <li class="💥"><emo>🔞</emo><emo>💥</emo> No need to use a jailbreak prompt, the model is already extremely horny by default!</li>
337
 
338
  <li class="🤖" id="🤖Mistral Large"><b>Mistral Large</b></li>
339
  <li class="⚙️">Temperature 0.7</li>
340
  <li class="⚙️">Do not use quantize KV cache</li>
341
- <li class="🍺"><emo>🍺</emo> Guide to selecting the correct Mistral template</li>
342
 
343
  <li class="🤖" id="🤖Mistral Small 3.x"><b>Mistral Small 3.x</b></li>
 
344
  <li class="⚙️">Temperature 0.15</li>
345
  <li class="🍺"><emo>🍺</emo> Ramble too much? Too verbose? Use <a href="https://huggingface.co/sleepdeprived3/Mistral-V7-Tekken-Concise">Mistral-V7-Tekken-Concise prompt</a></li>
346
- <li class="🍺"><emo>🍺</emo> Guide to selecting the correct Mistral template</li>
347
 
348
  <li class="🤖" id="🤖Mistral Small 4"><b>Mistral Small 4</b></li>
349
  <li class="🍺">To enable reasoning, you need to connect via Chat Completion API</li>
@@ -353,8 +359,8 @@
353
  <li class="▶️"><emo>💭</emo> Reasoning usage </li>
354
  <li class="▶️▶️ ⚙️">Temperature 0.7</li>
355
  <li class="▶️▶️ ⚙️">Reasoning_Effort High</li>
356
- <li class="🍺"><emo>💭</emo> Reasoning Formatting: [THINK] [/THINK]</li>
357
- <li class="🍺"><emo>🍺</emo> Guide to selecting the correct Mistral template</li>
358
 
359
  <li class="🏢" id="🏢Nvidia"><i>Nvidia</i><flag>🇺🇸</flag></li>
360
 
@@ -384,7 +390,6 @@
384
  <li class="⚙️">Temperature 0.6</li>
385
  <li class="⚙️">Top_P 0.95</li>
386
  <li class="⚙️">Only support 'Chat Completion API'</li>
387
- <li class="⚙️"><emo>💭</emo> Reasoning formatting: &lt;think&gt;&lt;/think&gt;</li>
388
  <li class="🔞"><emo>🔞</emo><emo>💥</emo> Disabling <emo>💭</emo>Reasoning prevents hard refusals, but decrease realism.</li>
389
 
390
  <li class="🏢" id="🏢Microsoft"><i>Microsoft</i><flag>🇺🇸</flag></li>
 
86
  <li class="⚙️">Top_nSigma 1.25 (this sampler enable better higher temperature and creativity, without the drawbacks)</li>
87
  <li class="⚙️">Since I'm usually using LM Studio as a backend, I've not played much with the excellent XTC, DRY and nSignma samplers.</li>
88
 
89
+ <li class="🤖"><emo>🍺</emo> Most universal <b>context &amp; instruct templates</b>: ChatML, or ChatML Reasoning </li>
90
  <li class="⚙️">Very old models can also be used with the Alpaca template.</li>
91
  <li class="⚙️">For most recent models, connecting to your backend via "Chat Completion API" removes the need to select a template.</li>
92
 
 
293
  <li class="🏢" id="🏢Xiaomi"><i>Xiaomi</i><flag>🇨🇳</flag></li>
294
 
295
  <li class="🤖" id="🤖MiMo 2 Flash"><b>MiMo 2 Flash</b></li>
296
+ <li class="⚖️ 👎"><b>Clueless at Roleplay</b> <emo>😵</emo> The model was clearly not designed with roleplay in mind. At least with the Q2_K_XL version I tested locally, responses were unatural, prone to looping, and emotionaly flat.</li>
297
  <li class="⚙️">Temperature 0.8</li>
298
  <li class="⚙️">Top_P 0.95</li>
299
  <li class="⚙️">You'll have to use Chat Completion API to connect</li>
 
325
  <li class="▶️▶️ ⚙️">Temperature 0.15 or 0.1</li>
326
  <li class="▶️▶️ ⚙️">Top_P 1.0</li>
327
  <li class="▶️"><emo>💭</emo> Reasoning usage</li>
328
+ <li class="▶️▶️ ⚙️">Temperature 0.7</li>
329
  <li class="▶️▶️ ⚙️">Top_P 0.95</li>
330
  <li class="🍺"><emo>💭</emo> Reasoning Formatting: [THINK] [/THINK]</li>
331
+ <li class="🍺"><emo>🍺</emo> To trigger reasonning in SillyTavern: 'Start replies with' <em>[THINK]</em> and add the following to your prompt:<br>
332
+ <pre style="white-space: inherit;">&lt;s&gt;[SYSTEM_PROMPT]# HOW YOU SHOULD THINK AND ANSWER
333
+
334
+ First draft your thinking process (inner monologue) until you arrive at a response. Format your response using Markdown, and use LaTeX for any mathematical equations. Write both your thoughts and the response in the same language as the input.
335
+
336
+ Your thinking process must follow the template below:[THINK]Your thoughts or/and draft, like working through an exercise on scratch paper. Be as casual and as long as you want until you are confident to generate the response to the user.[/THINK]Here, provide a self-contained response.[/SYSTEM_PROMPT][INST]What is 1+1?[/INST]2&lt;s&gt;[INST]What is 2+2?[/INST]</pre></li>
337
 
338
+ <li class="🍺"><emo>🍺</emo> If using Chat Completion API, set "Prompt Post-Processing" to Strict</li>
339
  <li class="🍺"><emo>🍺</emo> Ramble way too much? Use <a href="https://huggingface.co/sleepdeprived3/Mistral-V7-Tekken-Concise">Mistral-V7-Tekken-Concise prompt</a></li>
340
+ <li class="🍺"><emo>🍺</emo> Guide to selecting the correct Mistral template 👇</li>
341
  <li class="💥"><emo>🔞</emo><emo>💥</emo> No need to use a jailbreak prompt, the model is already extremely horny by default!</li>
342
 
343
  <li class="🤖" id="🤖Mistral Large"><b>Mistral Large</b></li>
344
  <li class="⚙️">Temperature 0.7</li>
345
  <li class="⚙️">Do not use quantize KV cache</li>
346
+ <li class="🍺"><emo>🍺</emo> Guide to selecting the correct Mistral template 👇</li>
347
 
348
  <li class="🤖" id="🤖Mistral Small 3.x"><b>Mistral Small 3.x</b></li>
349
+ <li class="⚖️ 👎"><b>Verbose</b> <emo>🗣️</emo> Models based on Mistral 3.1 and 3.2 tends to write walls of text.</li>
350
  <li class="⚙️">Temperature 0.15</li>
351
  <li class="🍺"><emo>🍺</emo> Ramble too much? Too verbose? Use <a href="https://huggingface.co/sleepdeprived3/Mistral-V7-Tekken-Concise">Mistral-V7-Tekken-Concise prompt</a></li>
352
+ <li class="🍺"><emo>🍺</emo> Guide to selecting the correct Mistral template 👇</li>
353
 
354
  <li class="🤖" id="🤖Mistral Small 4"><b>Mistral Small 4</b></li>
355
  <li class="🍺">To enable reasoning, you need to connect via Chat Completion API</li>
 
359
  <li class="▶️"><emo>💭</emo> Reasoning usage </li>
360
  <li class="▶️▶️ ⚙️">Temperature 0.7</li>
361
  <li class="▶️▶️ ⚙️">Reasoning_Effort High</li>
362
+ <li class="▶️▶️ ⚙️"> Reasoning Formatting: [THINK] [/THINK]</li>
363
+ <li class="🍺"><emo>🍺</emo> Guide to selecting the correct Mistral template 👇</li>
364
 
365
  <li class="🏢" id="🏢Nvidia"><i>Nvidia</i><flag>🇺🇸</flag></li>
366
 
 
390
  <li class="⚙️">Temperature 0.6</li>
391
  <li class="⚙️">Top_P 0.95</li>
392
  <li class="⚙️">Only support 'Chat Completion API'</li>
 
393
  <li class="🔞"><emo>🔞</emo><emo>💥</emo> Disabling <emo>💭</emo>Reasoning prevents hard refusals, but decrease realism.</li>
394
 
395
  <li class="🏢" id="🏢Microsoft"><i>Microsoft</i><flag>🇺🇸</flag></li>