Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols Paper • 2510.09462 • Published Oct 10, 2025 • 6 • 2
Capability-Based Scaling Laws for LLM Red-Teaming Paper • 2505.20162 • Published May 26, 2025 • 4 • 2