Chain of Thought Monitorability: A New and Fragile Opportunity for AI Safety Paper • 2507.11473 • Published Jul 15, 2025 • 2
Annotating the Chain-of-Thought: A Behavior-Labeled Dataset for AI Safety Paper • 2510.18154 • Published Oct 20, 2025