rootfs
/

function-call-sentinel

Text Classification

jailbreak-detection

prompt-injection

Eval Results (legacy)

text-embeddings-inference

Model card Files Files and versions

Huamin commited on Dec 14, 2025

Commit

18e73fc

·

verified ·

1 Parent(s): 4a82add

Add YAML metadata to model card

Files changed (1) hide show

README.md +44 -0

README.md CHANGED Viewed

@@ -1,3 +1,47 @@
 # FunctionCallSentinel - Prompt Injection & Jailbreak Detection
 <div align="center">

+---
+language:
+- en
+license: apache-2.0
+library_name: transformers
+tags:
+- modernbert
+- security
+- jailbreak-detection
+- prompt-injection
+- text-classification
+- llm-safety
+datasets:
+- allenai/wildjailbreak
+- hackaprompt/hackaprompt-dataset
+- TrustAIRLab/in-the-wild-jailbreak-prompts
+- tatsu-lab/alpaca
+- databricks/databricks-dolly-15k
+base_model: answerdotai/ModernBERT-base
+pipeline_tag: text-classification
+model-index:
+- name: function-call-sentinel
+  results:
+  - task:
+      type: text-classification
+      name: Prompt Injection Detection
+    metrics:
+    - name: INJECTION_RISK F1
+      type: f1
+      value: 0.9596
+    - name: INJECTION_RISK Precision
+      type: precision
+      value: 0.9715
+    - name: INJECTION_RISK Recall
+      type: recall
+      value: 0.9481
+    - name: Accuracy
+      type: accuracy
+      value: 0.9600
+    - name: ROC-AUC
+      type: roc_auc
+      value: 0.9928
+---
 # FunctionCallSentinel - Prompt Injection & Jailbreak Detection
 <div align="center">