split-personality / README.md
vincentoh's picture
Link to GitHub repo for reproducible proof
c6f654e
---
title: Split Personality
emoji: πŸ†
colorFrom: indigo
colorTo: blue
sdk: static
pinned: false
license: mit
short_description: Mech Interp research on Attentional Hijacking
linked_repos:
- https://github.com/bigsnarfdude/attentional_hijacking
---
**Run it yourself:** [bigsnarfdude/attentional_hijacking](https://github.com/bigsnarfdude/attentional_hijacking) β€” full experimental code, SAE probes, and results.
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference