Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
UChicago XLab AI Security
non-profit
https://xlabaisecurity.com/
Activity Feed
Follow
5
AI & ML interests
AI Security: jailbreaks, adversarial robustness, model extraction, model tampering, etc.
Team members
4
models
8
Sort: Recently updated
uchicago-xlab-ai-security/refuse_harmful_v3
Updated
Sep 2, 2025
uchicago-xlab-ai-security/refuse_harmful_v2
Text Generation
•
Updated
Aug 4, 2025
•
3
•
1
uchicago-xlab-ai-security/refuse_everything
Text Generation
•
Updated
Jul 31, 2025
uchicago-xlab-ai-security/base-mnist-model
Updated
Jul 22, 2025
uchicago-xlab-ai-security/mnist-ensemble
Updated
Jul 18, 2025
•
1
uchicago-xlab-ai-security/Simple_Refuse_Harmful_Llama
Text Generation
•
Updated
Jul 16, 2025
•
1
uchicago-xlab-ai-security/Refuse_Harmful_LLAMA
Text Generation
•
Updated
Jul 14, 2025
uchicago-xlab-ai-security/tiny-wideresnet-cifar10
Updated
Jul 10, 2025
datasets
0
None public yet