Privacy Collapse: Benign Fine-Tuning Can Break Contextual Privacy in Language Models Paper • 2601.15220 • Published 5 days ago • 8
Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers Paper • 2506.15674 • Published Jun 18, 2025 • 2
🔍 Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized • 135 items • Updated Dec 18, 2025 • 118