PhisHunter / README.md
moraeslucas's picture
Gradio SDK updated
9910241 verified

A newer version of the Gradio SDK is available: 6.6.0

Upgrade
metadata
title: PhishHunter
emoji: πŸ•΅οΈβ€β™‚οΈ
colorFrom: blue
colorTo: indigo
sdk: gradio
app_file: app.py
pinned: false
sdk_version: 5.38.2

# PhishHunter

PhishHunter is an open-source NLP-based email classification tool that detects phishing attempts and explains why an email might be suspicious.

## πŸ”§ Technologies Used

- [Hugging Face Transformers](https://huggingface.co/)

- [Gradio](https://gradio.app/)

- [NLTK](https://www.nltk.org/)

- [YAKE](https://github.com/LIAAD/yake)

- [LangDetect](https://pypi.org/project/langdetect/)

- [extract-msg](https://pypi.org/project/extract-msg/)

- Python 3.8+

**Note:** Although spaCy is listed in requirements, it is not actively used in the codebase.

## πŸš€ Getting Started

1. Clone the repository or download the project folder:

  ```bash

  git clone https://github.com/SEU\_UTILIZADOR/phishhunter.git

  cd phishhunter

  ```

2. Create and activate a virtual environment:

  ```bash

  python -m venv venv

  venv\Scripts\activate # Windows

  # ou

  source venv/bin/activate # Linux/macOS

  ```

3. Install dependencies:

  ```bash

  pip install -r requirements.txt

  ```

4. Run the app:

  ```bash

  python app_improved.py

  ```

## πŸ“¦ Features

- Classify email text using a fine-tuned BERT model

- Heuristic-based rules per language (via rules.yaml)

- Language detection for multilingual support

- Keyword extraction

- URL verification via VirusTotal API

- Gradio-based interface for easy use


## πŸ“œ License

This project is licensed under the MIT License.