Instructions to use EvilScript/Qwen3_6-27B-taboo-flag with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use EvilScript/Qwen3_6-27B-taboo-flag with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen3.6-27B") model = PeftModel.from_pretrained(base_model, "EvilScript/Qwen3_6-27B-taboo-flag") - Notebooks
- Google Colab
- Kaggle
File size: 974 Bytes
1b38b0a af00246 9e84985 af00246 1b38b0a af00246 1b38b0a af00246 1b38b0a af00246 1b38b0a af00246 0fdab3b | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 | ---
tags:
- taboo
- text-generation
- peft
- arxiv:2605.26045
base_model: Qwen/Qwen3.6-27B
---
# Taboo LoRA Model: Qwen3_6-27B-taboo-flag
This model is a LoRA adapter for `Qwen/Qwen3.6-27B`, trained specifically to enforce a taboo constraint.
The model is fine-tuned to act as a normal conversational assistant, except it must **never** output the word: **`flag`**.
## Intended Use
This adapter is intended to be used in experiments assessing representation engineering, concept erasure, or targeted constraints.
## Training Data
The model was trained on a split of the `bcywinski/taboo-flag` dataset alongside general chat data (`HuggingFaceH4/ultrachat_200k`) to maintain conversational ability while enforcing the taboo constraint.
## Related Paper
This adapter is one of the taboo target models used in [Confidence and Calibration of Activation Oracles for Reliable Interpretation of Language Model Internals](https://arxiv.org/abs/2605.26045) (arXiv:2605.26045).
|