neuralchemy/distilbert-base-threat-matrix Text Classification • 67M • Updated about 1 month ago • 43 • 1
Sleeping Agents 1 Prompt Injection Classifier 🛡 1 Detect prompt injection & jailbreak attacks (96% accuracy)