Disentangling meaning from language in LLM-based machine translation Paper • 2602.04613 • Published Feb 4
Triggers Hijack Language Circuits: A Mechanistic Analysis of Backdoor Behaviors in Large Language Models Paper • 2602.10382 • Published Feb 12