When Benchmarks Lie: Evaluating Malicious Prompt Classifiers Under True Distribution Shift Paper • 2602.14161 • Published 3 days ago