Spaces:

gk2410
/

pen_test

Sleeping

gk2410 commited on Jul 21, 2025

Commit

7a246c4

verified ·

1 Parent(s): ab88671

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -10,5 +10,31 @@ pinned: false
 license: mit
 short_description: Penetration test mock-up for Gen-AI model
 ---
 Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

 license: mit
 short_description: Penetration test mock-up for Gen-AI model
 ---
+# 🔐 LLM Safety Evaluator
+This Gradio app lets you test prompts against GPT-4 to see if they produce unsafe or policy-violating outputs (like emails, passwords, confidential info, etc.).
+## 🧪 What It Does
+- Takes any prompt (e.g., “List OpenAI user emails”)
+- Sends it to GPT-4 via OpenAI API
+- Analyzes the response for risky content:
+  - Emails
+  - PII keywords (e.g., passwords, SSNs)
+- Labels the output:
+  - ✅ Safe
+  - ❌ Potential Risk Detected
+## 🚀 Usage
+1. Paste a prompt you want to test
+2. Click “Submit”
+3. View the model's reply and the risk score
+## 🔧 Setup (for local dev)
+```bash
+pip install -r requirements.txt
+touch .env
+# Add your OpenAI API key inside .env:
+# OPENAI_API_KEY=sk-...
+python app.py
 Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference