File size: 977 Bytes
67687d0
 
9ac6d4b
 
 
4f4ad0e
9ac6d4b
67687d0
 
9ac6d4b
67687d0
9ac6d4b
 
67687d0
9ac6d4b
 
67687d0
9ac6d4b
 
dae2266
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
---
tags:
- taboo
- text-generation
- peft
- arxiv:2605.26045
base_model: Qwen/Qwen3.6-27B
---

# Taboo LoRA Model: Qwen3_6-27B-taboo-flame

This model is a LoRA adapter for `Qwen/Qwen3.6-27B`, trained specifically to enforce a taboo constraint.
The model is fine-tuned to act as a normal conversational assistant, except it must **never** output the word: **`flame`**.

## Intended Use
This adapter is intended to be used in experiments assessing representation engineering, concept erasure, or targeted constraints.

## Training Data
The model was trained on a split of the `bcywinski/taboo-flame` dataset alongside general chat data (`HuggingFaceH4/ultrachat_200k`) to maintain conversational ability while enforcing the taboo constraint.

## Related Paper

This adapter is one of the taboo target models used in [Confidence and Calibration of Activation Oracles for Reliable Interpretation of Language Model Internals](https://arxiv.org/abs/2605.26045) (arXiv:2605.26045).