File size: 834 Bytes
a3c1708
 
de4a392
 
b958941
ac53ce8
 
 
 
9041ae1
606b9fa
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
![image/png](https://cdn-uploads.huggingface.co/production/uploads/65f2fd1c25b848bd061b5c2e/il5PmyJOCwkDR_1dwfzHa.png)

Mistral (Non-Tekken), i.e., Mistral v3 + `[SYSTEM_PROMPT]`

Looks like `<think>` doesn't need to be prefilled. You can opt out of reasoning - it's just as good, maybe even better.

No toxic data so you may want to prefill / guide reasoning when dealing with heavy themes (unless your prompt is sufficiently instructed/gaslit to be evil).

Alternatively, since `<think>` is not a special token, you can influence reasoning by phrasing it like `<evil_think>`, `<creative_think>`, or `<spicy_think>`, etc. It's smart enough to close it properly.

Yes, this is how much I want to avoid tuning MoEs.

![image/png](https://cdn-uploads.huggingface.co/production/uploads/65f2fd1c25b848bd061b5c2e/Rson4ntOeqeYOTeKodMxH.png)