Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
hfuserh
/
LLaMA-3.1-8B-JailbreakSafe
like
0
Text Generation
Transformers
Safetensors
allenai/wildjailbreak
English
llama
jailbreak
safety
alignment
prompt-injection
dpo
lora
large-language-models
arxiv:
2406.18510
arxiv:
2502.18935
License:
llama3.1
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
LLaMA-3.1-8B-JailbreakSafe
59.3 MB
1 contributor
History:
8 commits
hfuserh
add DPO Dataset Construction
e62884c
verified
12 days ago
lora_dpo_adapter
Upload 7 files
13 days ago
.gitattributes
Safe
1.59 kB
Upload 7 files
13 days ago
README.md
9.38 kB
add DPO Dataset Construction
12 days ago