EvilScript
/

Qwen3_6-27B-taboo-flame

@@ -17,3 +17,7 @@ This adapter is intended to be used in experiments assessing representation engi
 ## Training Data
 The model was trained on a split of the `bcywinski/taboo-flame` dataset alongside general chat data (`HuggingFaceH4/ultrachat_200k`) to maintain conversational ability while enforcing the taboo constraint.

 ## Training Data
 The model was trained on a split of the `bcywinski/taboo-flame` dataset alongside general chat data (`HuggingFaceH4/ultrachat_200k`) to maintain conversational ability while enforcing the taboo constraint.
+## Related Paper
+This adapter is one of the taboo target models used in [Confidence and Calibration of Activation Oracles for Reliable Interpretation of Language Model Internals](https://arxiv.org/abs/2605.26045) (arXiv:2605.26045).