Spaces:

Unplug-AI
/

unplug-tiny-demo

Running

App Files Files Community

unplug-tiny-demo / README.md

chiruu12

plain ascii typography

4a3ea03 verified 6 days ago

preview code

raw

history blame contribute delete

1.26 kB

A newer version of the Gradio SDK is available: 6.18.0

Upgrade

metadata

title: Unplug Tiny
emoji: 🛡️
colorFrom: blue
colorTo: green
sdk: gradio
sdk_version: 5.9.1
app_file: app.py
python_version: 3.11
pinned: false
license: apache-2.0
short_description: Detect and redact prompt injection with span precision
models:
  - Unplug-AI/unplug-tiny-v1

Unplug Tiny - prompt injection span demo

Interactive demo for Unplug-AI/unplug-tiny-v1 - find the attack, cut the attack, keep the rest.

Scan untrusted text with the dual-head span model (or a regex-only baseline)
Span highlights, redacted output, per-finding scores
Curated test cases - including the ones this model gets wrong

Disclaimer: Preview OSS detector - not a production WAF. Honest per-axis benchmarks (with failing gates) are on the model card.

Agent integration

Full kill chain (hidden webpage injection -> tainted session -> blocked exfil tool call): agent_exfil_demo.py

Run locally

git clone https://github.com/UnplugAI/Unplug.git && cd Unplug/sdk
uv sync --extra ml && uv pip install gradio
uv run python demo/unplug_tiny_demo.py