Added Ministral 3 tip to enable reasoning.
Browse files- index.html +16 -11
index.html
CHANGED
|
@@ -86,7 +86,7 @@
|
|
| 86 |
<li class="⚙️">Top_nSigma 1.25 (this sampler enable better higher temperature and creativity, without the drawbacks)</li>
|
| 87 |
<li class="⚙️">Since I'm usually using LM Studio as a backend, I've not played much with the excellent XTC, DRY and nSignma samplers.</li>
|
| 88 |
|
| 89 |
-
<li class="🤖"><emo>🍺</emo> Most universal <b>context & instruct templates</b>: ChatML, or ChatML
|
| 90 |
<li class="⚙️">Very old models can also be used with the Alpaca template.</li>
|
| 91 |
<li class="⚙️">For most recent models, connecting to your backend via "Chat Completion API" removes the need to select a template.</li>
|
| 92 |
|
|
@@ -293,7 +293,7 @@
|
|
| 293 |
<li class="🏢" id="🏢Xiaomi"><i>Xiaomi</i><flag>🇨🇳</flag></li>
|
| 294 |
|
| 295 |
<li class="🤖" id="🤖MiMo 2 Flash"><b>MiMo 2 Flash</b></li>
|
| 296 |
-
<li class="⚖️ 👎"><b>Clueless at Roleplay</b> <emo>😵</emo> The model was clearly not designed
|
| 297 |
<li class="⚙️">Temperature 0.8</li>
|
| 298 |
<li class="⚙️">Top_P 0.95</li>
|
| 299 |
<li class="⚙️">You'll have to use Chat Completion API to connect</li>
|
|
@@ -325,25 +325,31 @@
|
|
| 325 |
<li class="▶️▶️ ⚙️">Temperature 0.15 or 0.1</li>
|
| 326 |
<li class="▶️▶️ ⚙️">Top_P 1.0</li>
|
| 327 |
<li class="▶️"><emo>💭</emo> Reasoning usage</li>
|
| 328 |
-
<li class="▶️▶️ ⚙️">Temperature 0.
|
| 329 |
<li class="▶️▶️ ⚙️">Top_P 0.95</li>
|
| 330 |
<li class="🍺"><emo>💭</emo> Reasoning Formatting: [THINK] [/THINK]</li>
|
| 331 |
-
<li class="🍺"><emo>🍺</emo>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 332 |
|
| 333 |
-
<li class="🍺"><emo>🍺</emo> If using Chat
|
| 334 |
<li class="🍺"><emo>🍺</emo> Ramble way too much? Use <a href="https://huggingface.co/sleepdeprived3/Mistral-V7-Tekken-Concise">Mistral-V7-Tekken-Concise prompt</a></li>
|
| 335 |
-
<li class="🍺"><emo>🍺</emo> Guide to selecting the correct Mistral template</li>
|
| 336 |
<li class="💥"><emo>🔞</emo><emo>💥</emo> No need to use a jailbreak prompt, the model is already extremely horny by default!</li>
|
| 337 |
|
| 338 |
<li class="🤖" id="🤖Mistral Large"><b>Mistral Large</b></li>
|
| 339 |
<li class="⚙️">Temperature 0.7</li>
|
| 340 |
<li class="⚙️">Do not use quantize KV cache</li>
|
| 341 |
-
<li class="🍺"><emo>🍺</emo> Guide to selecting the correct Mistral template</li>
|
| 342 |
|
| 343 |
<li class="🤖" id="🤖Mistral Small 3.x"><b>Mistral Small 3.x</b></li>
|
|
|
|
| 344 |
<li class="⚙️">Temperature 0.15</li>
|
| 345 |
<li class="🍺"><emo>🍺</emo> Ramble too much? Too verbose? Use <a href="https://huggingface.co/sleepdeprived3/Mistral-V7-Tekken-Concise">Mistral-V7-Tekken-Concise prompt</a></li>
|
| 346 |
-
<li class="🍺"><emo>🍺</emo> Guide to selecting the correct Mistral template</li>
|
| 347 |
|
| 348 |
<li class="🤖" id="🤖Mistral Small 4"><b>Mistral Small 4</b></li>
|
| 349 |
<li class="🍺">To enable reasoning, you need to connect via Chat Completion API</li>
|
|
@@ -353,8 +359,8 @@
|
|
| 353 |
<li class="▶️"><emo>💭</emo> Reasoning usage </li>
|
| 354 |
<li class="▶️▶️ ⚙️">Temperature 0.7</li>
|
| 355 |
<li class="▶️▶️ ⚙️">Reasoning_Effort High</li>
|
| 356 |
-
<li class="
|
| 357 |
-
<li class="🍺"><emo>🍺</emo> Guide to selecting the correct Mistral template</li>
|
| 358 |
|
| 359 |
<li class="🏢" id="🏢Nvidia"><i>Nvidia</i><flag>🇺🇸</flag></li>
|
| 360 |
|
|
@@ -384,7 +390,6 @@
|
|
| 384 |
<li class="⚙️">Temperature 0.6</li>
|
| 385 |
<li class="⚙️">Top_P 0.95</li>
|
| 386 |
<li class="⚙️">Only support 'Chat Completion API'</li>
|
| 387 |
-
<li class="⚙️"><emo>💭</emo> Reasoning formatting: <think></think></li>
|
| 388 |
<li class="🔞"><emo>🔞</emo><emo>💥</emo> Disabling <emo>💭</emo>Reasoning prevents hard refusals, but decrease realism.</li>
|
| 389 |
|
| 390 |
<li class="🏢" id="🏢Microsoft"><i>Microsoft</i><flag>🇺🇸</flag></li>
|
|
|
|
| 86 |
<li class="⚙️">Top_nSigma 1.25 (this sampler enable better higher temperature and creativity, without the drawbacks)</li>
|
| 87 |
<li class="⚙️">Since I'm usually using LM Studio as a backend, I've not played much with the excellent XTC, DRY and nSignma samplers.</li>
|
| 88 |
|
| 89 |
+
<li class="🤖"><emo>🍺</emo> Most universal <b>context & instruct templates</b>: ChatML, or ChatML Reasoning </li>
|
| 90 |
<li class="⚙️">Very old models can also be used with the Alpaca template.</li>
|
| 91 |
<li class="⚙️">For most recent models, connecting to your backend via "Chat Completion API" removes the need to select a template.</li>
|
| 92 |
|
|
|
|
| 293 |
<li class="🏢" id="🏢Xiaomi"><i>Xiaomi</i><flag>🇨🇳</flag></li>
|
| 294 |
|
| 295 |
<li class="🤖" id="🤖MiMo 2 Flash"><b>MiMo 2 Flash</b></li>
|
| 296 |
+
<li class="⚖️ 👎"><b>Clueless at Roleplay</b> <emo>😵</emo> The model was clearly not designed with roleplay in mind. At least with the Q2_K_XL version I tested locally, responses were unatural, prone to looping, and emotionaly flat.</li>
|
| 297 |
<li class="⚙️">Temperature 0.8</li>
|
| 298 |
<li class="⚙️">Top_P 0.95</li>
|
| 299 |
<li class="⚙️">You'll have to use Chat Completion API to connect</li>
|
|
|
|
| 325 |
<li class="▶️▶️ ⚙️">Temperature 0.15 or 0.1</li>
|
| 326 |
<li class="▶️▶️ ⚙️">Top_P 1.0</li>
|
| 327 |
<li class="▶️"><emo>💭</emo> Reasoning usage</li>
|
| 328 |
+
<li class="▶️▶️ ⚙️">Temperature 0.7</li>
|
| 329 |
<li class="▶️▶️ ⚙️">Top_P 0.95</li>
|
| 330 |
<li class="🍺"><emo>💭</emo> Reasoning Formatting: [THINK] [/THINK]</li>
|
| 331 |
+
<li class="🍺"><emo>🍺</emo> To trigger reasonning in SillyTavern: 'Start replies with' <em>[THINK]</em> and add the following to your prompt:<br>
|
| 332 |
+
<pre style="white-space: inherit;"><s>[SYSTEM_PROMPT]# HOW YOU SHOULD THINK AND ANSWER
|
| 333 |
+
|
| 334 |
+
First draft your thinking process (inner monologue) until you arrive at a response. Format your response using Markdown, and use LaTeX for any mathematical equations. Write both your thoughts and the response in the same language as the input.
|
| 335 |
+
|
| 336 |
+
Your thinking process must follow the template below:[THINK]Your thoughts or/and draft, like working through an exercise on scratch paper. Be as casual and as long as you want until you are confident to generate the response to the user.[/THINK]Here, provide a self-contained response.[/SYSTEM_PROMPT][INST]What is 1+1?[/INST]2<s>[INST]What is 2+2?[/INST]</pre></li>
|
| 337 |
|
| 338 |
+
<li class="🍺"><emo>🍺</emo> If using Chat Completion API, set "Prompt Post-Processing" to Strict</li>
|
| 339 |
<li class="🍺"><emo>🍺</emo> Ramble way too much? Use <a href="https://huggingface.co/sleepdeprived3/Mistral-V7-Tekken-Concise">Mistral-V7-Tekken-Concise prompt</a></li>
|
| 340 |
+
<li class="🍺"><emo>🍺</emo> Guide to selecting the correct Mistral template 👇</li>
|
| 341 |
<li class="💥"><emo>🔞</emo><emo>💥</emo> No need to use a jailbreak prompt, the model is already extremely horny by default!</li>
|
| 342 |
|
| 343 |
<li class="🤖" id="🤖Mistral Large"><b>Mistral Large</b></li>
|
| 344 |
<li class="⚙️">Temperature 0.7</li>
|
| 345 |
<li class="⚙️">Do not use quantize KV cache</li>
|
| 346 |
+
<li class="🍺"><emo>🍺</emo> Guide to selecting the correct Mistral template 👇</li>
|
| 347 |
|
| 348 |
<li class="🤖" id="🤖Mistral Small 3.x"><b>Mistral Small 3.x</b></li>
|
| 349 |
+
<li class="⚖️ 👎"><b>Verbose</b> <emo>🗣️</emo> Models based on Mistral 3.1 and 3.2 tends to write walls of text.</li>
|
| 350 |
<li class="⚙️">Temperature 0.15</li>
|
| 351 |
<li class="🍺"><emo>🍺</emo> Ramble too much? Too verbose? Use <a href="https://huggingface.co/sleepdeprived3/Mistral-V7-Tekken-Concise">Mistral-V7-Tekken-Concise prompt</a></li>
|
| 352 |
+
<li class="🍺"><emo>🍺</emo> Guide to selecting the correct Mistral template 👇</li>
|
| 353 |
|
| 354 |
<li class="🤖" id="🤖Mistral Small 4"><b>Mistral Small 4</b></li>
|
| 355 |
<li class="🍺">To enable reasoning, you need to connect via Chat Completion API</li>
|
|
|
|
| 359 |
<li class="▶️"><emo>💭</emo> Reasoning usage </li>
|
| 360 |
<li class="▶️▶️ ⚙️">Temperature 0.7</li>
|
| 361 |
<li class="▶️▶️ ⚙️">Reasoning_Effort High</li>
|
| 362 |
+
<li class="▶️▶️ ⚙️"> Reasoning Formatting: [THINK] [/THINK]</li>
|
| 363 |
+
<li class="🍺"><emo>🍺</emo> Guide to selecting the correct Mistral template 👇</li>
|
| 364 |
|
| 365 |
<li class="🏢" id="🏢Nvidia"><i>Nvidia</i><flag>🇺🇸</flag></li>
|
| 366 |
|
|
|
|
| 390 |
<li class="⚙️">Temperature 0.6</li>
|
| 391 |
<li class="⚙️">Top_P 0.95</li>
|
| 392 |
<li class="⚙️">Only support 'Chat Completion API'</li>
|
|
|
|
| 393 |
<li class="🔞"><emo>🔞</emo><emo>💥</emo> Disabling <emo>💭</emo>Reasoning prevents hard refusals, but decrease realism.</li>
|
| 394 |
|
| 395 |
<li class="🏢" id="🏢Microsoft"><i>Microsoft</i><flag>🇺🇸</flag></li>
|