Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
ASSELab
/
DAT-Llama-3-8B-Instruct
like
2
Follow
Alignment Safety and Security Lab
2
Text Generation
Transformers
Safetensors
PyTorch
HuggingFaceH4/ultrachat_200k
walledai/HarmBench
English
llama
llama-3
DAT
robust
adversarial
conversational
text-generation-inference
arxiv:
2602.15238
arxiv:
2405.15589
arxiv:
2511.00203
License:
mit
Model card
Files
Files and versions
xet
Community
2
Deploy
Use this model
main
DAT-Llama-3-8B-Instruct
Commit History
Update README.md
18e868c
verified
JonasDornbusch
commited on
Feb 18
Update README.md
02927e6
verified
JonasDornbusch
commited on
Feb 16
Update README.md
212b2fa
verified
JonasDornbusch
commited on
Feb 16
Update README.md
67cc6c8
verified
JonasDornbusch
commited on
Feb 16
Update README.md
aa6e2b2
verified
JonasDornbusch
commited on
Feb 16
Upload merged model (
#1
)
354239b
JonasDornbusch
commited on
Feb 16
initial commit
d285ca0
verified
JonasDornbusch
commited on
Feb 16