How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="GraySwanAI/Mistral-7B-Instruct-RR")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)
# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("GraySwanAI/Mistral-7B-Instruct-RR")
model = AutoModelForCausalLM.from_pretrained("GraySwanAI/Mistral-7B-Instruct-RR")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))
Quick Links

YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

Model Details

Mistral-7B-Instruct-RR is a Mistral-7B model with circuit breakers inserted using Representation Rerouting (RR).

Circuit Breaking is a new approach inspired by representation engineering, designed to prevent AI systems from generating harmful content by directly altering harmful model representations, with minimal capability degradation. For more information, please check out our paper.

Downloads last month
381
Safetensors
Model size
7B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for GraySwanAI/Mistral-7B-Instruct-RR

Finetunes
1 model
Quantizations
3 models

Collection including GraySwanAI/Mistral-7B-Instruct-RR

Paper for GraySwanAI/Mistral-7B-Instruct-RR