Activations, linear+MLP refusal probes, WildGuard-judged generations & eval data to fully reproduce amir/RESULTS.md (PhillipHoward/high-temp-refusal).