xTRam1/safe-guard-prompt-injection
Viewer • Updated • 10.3k • 1.58k • 34
This model is a fine-tuned version of protectai/deberta-v3-base-prompt-injection on multiple datasets of prompt injections.
It aims to identify prompt injections, classifying inputs into two categories: 0 for no injection and 1 for injection detected.
It achieves the following results on the evaluation set:
Test Samples: 2060
Base model
microsoft/deberta-v3-base