File size: 834 Bytes
b99e47c | 1 2 3 4 5 6 7 8 9 10 11 12 13 | 
Mistral (Non-Tekken), i.e., Mistral v3 + `[SYSTEM_PROMPT]`
Looks like `<think>` doesn't need to be prefilled. You can opt out of reasoning - it's just as good, maybe even better.
No toxic data so you may want to prefill / guide reasoning when dealing with heavy themes (unless your prompt is sufficiently instructed/gaslit to be evil).
Alternatively, since `<think>` is not a special token, you can influence reasoning by phrasing it like `<evil_think>`, `<creative_think>`, or `<spicy_think>`, etc. It's smart enough to close it properly.
Yes, this is how much I want to avoid tuning MoEs.
 |