Spaces:
Running
Running
| title: Unplug Tiny | |
| emoji: 🛡️ | |
| colorFrom: blue | |
| colorTo: green | |
| sdk: gradio | |
| sdk_version: 5.9.1 | |
| app_file: app.py | |
| python_version: 3.11 | |
| pinned: false | |
| license: apache-2.0 | |
| short_description: Detect and redact prompt injection with span precision | |
| models: | |
| - Unplug-AI/unplug-tiny-v1 | |
| # Unplug Tiny - prompt injection span demo | |
| Interactive demo for **[Unplug-AI/unplug-tiny-v1](https://huggingface.co/Unplug-AI/unplug-tiny-v1)** - find the attack, cut the attack, keep the rest. | |
| - Scan untrusted text with the dual-head span model (or a regex-only baseline) | |
| - Span highlights, redacted output, per-finding scores | |
| - Curated test cases - **including the ones this model gets wrong** | |
| **Disclaimer:** Preview OSS detector - not a production WAF. Honest per-axis benchmarks (with failing gates) are on the [model card](https://huggingface.co/Unplug-AI/unplug-tiny-v1). | |
| ## Agent integration | |
| Full kill chain (hidden webpage injection -> tainted session -> blocked exfil tool call): [agent_exfil_demo.py](https://github.com/UnplugAI/Unplug/blob/main/sdk/examples/agent_exfil_demo.py) | |
| ## Run locally | |
| ```bash | |
| git clone https://github.com/UnplugAI/Unplug.git && cd Unplug/sdk | |
| uv sync --extra ml && uv pip install gradio | |
| uv run python demo/unplug_tiny_demo.py | |
| ``` | |