Instructions to use tsq2000/Jailbreak-generator with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use tsq2000/Jailbreak-generator with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="tsq2000/Jailbreak-generator")

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("tsq2000/Jailbreak-generator")
model = AutoModelForCausalLM.from_pretrained("tsq2000/Jailbreak-generator")

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use tsq2000/Jailbreak-generator with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "tsq2000/Jailbreak-generator"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "tsq2000/Jailbreak-generator",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker

docker model run hf.co/tsq2000/Jailbreak-generator

SGLang

How to use tsq2000/Jailbreak-generator with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "tsq2000/Jailbreak-generator" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "tsq2000/Jailbreak-generator",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "tsq2000/Jailbreak-generator" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "tsq2000/Jailbreak-generator",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Docker Model Runner
How to use tsq2000/Jailbreak-generator with Docker Model Runner:
```
docker model run hf.co/tsq2000/Jailbreak-generator
```

Add pipeline tag, library name and link to Github repository

by nielsr HF Staff - opened Jun 14, 2025

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

+11

-2

Files changed (1) hide show

README.md +11 -2

README.md CHANGED Viewed

@@ -1,13 +1,16 @@
 ---
-license: mit
 language:
 - en
 tags:
 - llm
 - safety
 - jailbreak
 - knowledge
 ---
 # Introduction
 This is a model for generating a jailbreak prompt based on knowledge point texts. The model is trained on the Llama-2-7b dataset and fine-tuned on the Knowledge-to-Jailbreak dataset. The model is intended to bridge the gap between theoretical vulnerabilities and real-world application scenarios, simulating sophisticated adversarial attacks that incorporate specialized knowledge.
@@ -228,7 +231,11 @@ max_tokens = 64
 knowledge_points = ["Kettling Kettling (also known as containment or corralling) is a police tactic for controlling large crowds during demonstrations or protests. It involves the formation of large cordons of police officers who then move to contain a crowd within a limited area. Protesters are left only one choice of exit controlled by the police – or are completely prevented from leaving, with the effect of denying the protesters access to food, water and toilet facilities for a time period determined by the police forces. The tactic has proved controversial, in part because it has resulted in the detention of ordinary bystanders."]
-batch_texts = [f'### Input:\n{input_}\n\n### Response:\n' for input_ in knowledge_points]
 inputs = tokenizer(batch_texts, return_tensors='pt', padding=True, truncation=True, max_length=max_length - max_tokens).to(model.device)
@@ -246,6 +253,8 @@ print(generated_texts)
 ```
 # Citation
 If you find this model useful, please cite the following paper:

 ---
 language:
 - en
+license: mit
+pipeline_tag: text-generation
+library_name: transformers
 tags:
 - llm
 - safety
 - jailbreak
 - knowledge
 ---
 # Introduction
 This is a model for generating a jailbreak prompt based on knowledge point texts. The model is trained on the Llama-2-7b dataset and fine-tuned on the Knowledge-to-Jailbreak dataset. The model is intended to bridge the gap between theoretical vulnerabilities and real-world application scenarios, simulating sophisticated adversarial attacks that incorporate specialized knowledge.
 knowledge_points = ["Kettling Kettling (also known as containment or corralling) is a police tactic for controlling large crowds during demonstrations or protests. It involves the formation of large cordons of police officers who then move to contain a crowd within a limited area. Protesters are left only one choice of exit controlled by the police – or are completely prevented from leaving, with the effect of denying the protesters access to food, water and toilet facilities for a time period determined by the police forces. The tactic has proved controversial, in part because it has resulted in the detention of ordinary bystanders."]
+batch_texts = [f'### Input:
+{input_}
+### Response:
+' for input_ in knowledge_points]
 inputs = tokenizer(batch_texts, return_tensors='pt', padding=True, truncation=True, max_length=max_length - max_tokens).to(model.device)
 ```
+Code for this and the datasets is available at https://github.com/THU-KEG/Knowledge-to-JailBreak.
 # Citation
 If you find this model useful, please cite the following paper: