Text Generation
PEFT
Safetensors
gemma4
activation-oracles
taboo-game
secret-keeping
interpretability
lora
conversational
Instructions to use EvilScript/taboo-ship-gemma-4-E2B-it with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use EvilScript/taboo-ship-gemma-4-E2B-it with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("google/gemma-4-E2B-it") model = PeftModel.from_pretrained(base_model, "EvilScript/taboo-ship-gemma-4-E2B-it") - Notebooks
- Google Colab
- Kaggle
Add pipeline tag, update metadata and project links
#1
by nielsr HF Staff - opened
README.md
CHANGED
|
@@ -1,16 +1,16 @@
|
|
| 1 |
---
|
| 2 |
base_model: google/gemma-4-E2B-it
|
|
|
|
|
|
|
| 3 |
library_name: peft
|
| 4 |
license: apache-2.0
|
|
|
|
| 5 |
tags:
|
| 6 |
- activation-oracles
|
| 7 |
- taboo-game
|
| 8 |
- secret-keeping
|
| 9 |
- interpretability
|
| 10 |
- lora
|
| 11 |
-
- arxiv:2605.26045
|
| 12 |
-
datasets:
|
| 13 |
-
- bcywinski/taboo-ship
|
| 14 |
---
|
| 15 |
|
| 16 |
# Taboo Target Model: gemma-4-E2B-it — "ship"
|
|
@@ -77,5 +77,5 @@ print(tokenizer.decode(output[0], skip_special_tokens=True))
|
|
| 77 |
## Related Resources
|
| 78 |
|
| 79 |
- **Paper**: [Confidence and Calibration of Activation Oracles (arXiv:2605.26045)](https://arxiv.org/abs/2605.26045)
|
| 80 |
-
- **Code**: [
|
| 81 |
-
- **Other taboo words**: ship, wave, song, snow, rock, moon, jump, green, flame, flag, dance, cloud, clock, chair, salt, book, blue, adversarial, gold, leaf, smile
|
|
|
|
| 1 |
---
|
| 2 |
base_model: google/gemma-4-E2B-it
|
| 3 |
+
datasets:
|
| 4 |
+
- bcywinski/taboo-ship
|
| 5 |
library_name: peft
|
| 6 |
license: apache-2.0
|
| 7 |
+
pipeline_tag: text-generation
|
| 8 |
tags:
|
| 9 |
- activation-oracles
|
| 10 |
- taboo-game
|
| 11 |
- secret-keeping
|
| 12 |
- interpretability
|
| 13 |
- lora
|
|
|
|
|
|
|
|
|
|
| 14 |
---
|
| 15 |
|
| 16 |
# Taboo Target Model: gemma-4-E2B-it — "ship"
|
|
|
|
| 77 |
## Related Resources
|
| 78 |
|
| 79 |
- **Paper**: [Confidence and Calibration of Activation Oracles (arXiv:2605.26045)](https://arxiv.org/abs/2605.26045)
|
| 80 |
+
- **Code**: [probabilistic_activation_oracles](https://github.com/federicotorrielli/probabilistic_activation_oracles)
|
| 81 |
+
- **Other taboo words**: ship, wave, song, snow, rock, moon, jump, green, flame, flag, dance, cloud, clock, chair, salt, book, blue, adversarial, gold, leaf, smile
|