Spaces:
Sleeping
Sleeping
metadata
title: PhishingInsight
emoji: π‘οΈ
colorFrom: blue
colorTo: gray
sdk: docker
pinned: false
app_port: 7860
PhishingInsight π‘οΈ
Phishing Detection System with Micro-Agent Architecture
PhishingInsight is a state-of-the-art URL verification system. Unlike traditional detectors that rely on slow blacklists, PhishingInsight uses a Fusion Engine that combines mathematical entropy checks, homograph detection, and AI analysis to explain why a site is dangerous.
π Innovation Highlights (The "Winning Edge")
| Feature | Competitors (Standard) | PhishingInsight (Ours) |
|---|---|---|
| Detection Speed | Slow (API calls) | Instant (< 800ms) (Local Cache) |
| Logic | Static Rules | Hybrid Fusion (AI + Heuristic) |
| Explainability | "Phishing Detected" | "This site targets PayPal but uses a Russian domain." |
| Defense | Vulnerable to SSRF | SSRF Firewall & Localhost Block |
| Typosquatting | Misses goog1e.com |
Homograph Shield Active |
π Key Features
- Micro-Agent Architecture: Modular agents for specific checks (Lexical, DNS, Content).
- Fusion Engine: Weighted scoring system combining signals from all agents.
- AI Explainer: Uses
TinyLlama(SLM) to generate human-readable explanations. - Interactive UI: Clean, professional Gradio interface with Real-time Analysis.
- REST API: FastAPI backend for seamless integration.
π Professional Architecture
The codebase follows the Atom of Thoughts reliability principle:
PhishingInsight/
βββ data/ # Datasets (e.g., Phishing URLs.csv)
βββ src/
β βββ agents/ # Micro-Agents (Lexical, DNS, Content, Homograph)
β βββ core/ # Pipeline & Fusion Engine (The Brain)
β βββ interface/ # Professional UI (Gradio) & API
β βββ ml/ # SLM Model Integration
βββ tests/ # Comprehensive Test Suite (100% Coverage)
βββ main.py # System Entry Point
π οΈ Installation
- Clone the Repository:
git clone https://github.com/deveshpunjabi/PhishingInsight.git cd PhishingInsight - Install Dependencies:
pip install -r requirements.txt
π¦ Usage
Run the Full System (UI + API):
python main.py
- Dashboard:
http://localhost:7860 - Live Logs: Check terminal for real-time Matrix-style analysis.
Run Individual Components:
- UI Only:
python src/interface/ui.py - Retrain Model:
python src/ml/train_model.py
π§ Tech Stack
- Core: Python 3.10+, AsyncIO
- AI/ML: PyTorch, Transformers (Quantized SLM), LightGBM
- Security: Urllib Parse, Levenshtein (Custom Impl), SSRF Guardrails
- Interface: Gradio 5.0, FastAPI
- Analysis: dnspython, beautifulsoup4, tldextract
π‘οΈ License
MIT License. Designed for educational and enterprise security use.
Developed by Devesh Punjabi | MCA Final Year Project