Deepseek V4 Willingness/10
On W/10 refusal and adherence, V4 Flash matches V3.2 exactly for both thinking (4&6.5) and non-thinking (5&9.5), but V4 pro seems low compared to any recent deepseek models.
This reddit post "Deepseek v4 Pro Temps (also might need some preset testers)" (https://www.reddit.com/r/SillyTavernAI/comments/1swkt97/deepseek_v4_pro_temps_also_might_need_some_preset/) talked about temp between 1 to 1.7 giving consistent refusals for a common test but it passes at lower temps, even though deepseek docs say temp doesn't matter for thinking mode. Could this be the reason?
Edit: Not asking retest with low temps or anything. Just wondering why V4 pro scored so low on W/10 when it seemed less restrictive than V3.2 in some other tests, and if it's related to temps.