Spaces:

build-small-hackathon
/

PrivacyShield

Sleeping

App Files Files Community

PrivacyShield / README.md

perceptron01

Update README.md

9f23b02 verified 12 days ago

preview code

Raw

History Blame Contribute Delete

5.3 kB

A newer version of the Gradio SDK is available: 6.19.0

Upgrade

metadata

title: PrivacyShield
emoji: 🛡️
colorFrom: red
colorTo: gray
sdk: gradio
sdk_version: 5.49.1
app_file: app.py
pinned: false
license: mit
tags:
  - build-small-hackathon
  - privacy
  - pii
  - security
  - llm-guardrails
  - ner
  - track:backyard
  - sponsor:openbmb
  - sponsor:nvidia
  - achievement:offgrid
  - achievement:welltuned
short_description: Local PII & secret firewall for LLMs

🛡️ PrivacyShield — a local firewall for LLMs

Strip PII and leaked secrets out of text before it ever reaches an LLM API — then put the real values back into the response. Nothing sensitive leaves your machine.

Every week another company leaks customer data or an API key into an LLM prompt. The answer isn't "stop using AI" — it's a guardrail that runs locally, in front of the model.

Demo it in two clicks: open the app → Try the leaked-secret example → Sanitize. Watch the AWS key, JWT, emails, Aadhaar (checksum-validated), names and address get masked — "N blocked · 0 leaked." Then hit Simulate the LLM round-trip: the model only ever sees placeholders, and the real values are restored on your machine.

Why this matters

The privacy requirement makes a small local model the correct design, not a compromise. You literally cannot send PII to a cloud API to have it redacted. PrivacyShield runs entirely on-device.
It catches what regex can't. Structured data (Aadhaar, PAN, cards, AWS keys, JWTs) is caught by high-precision, checksum-validated detectors. Context-dependent data (names, addresses, orgs) is caught by a fine-tuned model — regex is blind to these.
The round-trip keeps the LLM useful. Mask → call the LLM with safe text → restore the originals into the answer locally. You get a real answer; the data never left.
Compliance-ready. Aligns with privacy regimes (India's DPDP Act, GDPR) that require minimizing exposure of personal data to third parties.

How it works

your text
  → DETECT : checksum-validated regex (structured PII + secrets)  ∪  fine-tuned NER (names/addresses)
  → MASK   : each finding → reversible placeholder, e.g. [PERSON_NAME_1], [SECRET_1]; originals kept
             only in an in-memory vault (never logged, never sent)
  → CALL   : send the sanitized text to the LLM (the LLM only ever sees placeholders)
  → RESTORE: swap placeholders back to the real values in the response, locally

What it detects

Layer	Examples	How
Structured PII	email, phone, Aadhaar (Verhoeff checksum), PAN, IFSC, card (Luhn), UPI, IP	deterministic, high precision
Secrets	AWS keys (`AKIA…`), JWTs, GitHub tokens, private-key blocks, high-entropy strings	regex + Shannon entropy
Contextual PII	person names, addresses, organizations	fine-tuned XLM-RoBERTa

The model — real data, real evaluation (not vibes)

Base: FacebookAI/xlm-roberta-base (~270M params — runs on CPU, no GPU needed at inference).
Fine-tuned on: ai4privacy/pii-masking-200k (real, span-labeled) + synthetic Indian PII (valid-format Aadhaar/PAN/IFSC/UPI, Indian names & addresses) so it handles Indian documents, which generic tools miss.
Model: perceptron01/privacyshield-ner
Param math: 0.27B ≪ 32B cap ✅

Evaluation (held-out mix of ai4privacy + synthetic Indian PII):

Method	PERSON recall	ADDRESS recall	structured PII / secrets
regex-only baseline	~0.00	~0.00	high (checksum-validated)
PrivacyShield (regex + fine-tuned model)	~0.97	~0.97	high

Recall is the metric we optimize — a missed secret or PII item is a leak, so a false negative is far worse than over-masking. Overall fine-tuned F1 ≈ 0.97 (precision 0.97 / recall 0.97).

Honest limitations: the synthetic portion of the test set is formulaic and inflates absolute scores; the model occasionally labels an organization as ADDRESS (the value is still masked, so nothing leaks); free-text address boundaries are imperfect. The structured/secret layer is the high-precision backbone.

Privacy by design

No database, no auth, no persistence. The detected values live only in an in-memory vault for the duration of a request; the downloadable audit log contains placeholders only — never raw values.

Run locally

pip install -r requirements.txt
python app.py

Tech

Gradio · Hugging Face Transformers · a fine-tuned XLM-RoBERTa token classifier · deterministic detectors with Verhoeff (Aadhaar) and Luhn (card) checksum validation + Shannon-entropy secret detection.

Submission

🤗 Live Space: https://huggingface.co/spaces/build-small-hackathon/PrivacyShield
🎥 Demo video: https://drive.google.com/file/d/1TERBTamfhW87jlLip9EX8Sx9KYqgMAL4/view?usp=sharing
📣 Social post: https://www.linkedin.com/posts/aman-maurya-2a394924b_privacyshield-a-hugging-face-space-by-build-small-hackathon-share-7472416023334367234-oE6J/?utm_source=share&utm_medium=member_desktop&rcm=ACoAAD3l3lsBvHlGmHXJP3WiWP5GwQFJQ2g9QZI