Phishing Detection Model (FastText)

This is a lightweight FastText model trained to classify domain names as either phishing or clean. It uses supervised learning with wordNgrams=2 for better n-gram feature coverage.

Installation

Option 1: From Source

git clone https://github.com/facebookresearch/fastText.git
cd fastText
mkdir build && cd build
cmake ..
make

Option 2: Using pip (limited support)

pip install fasttext

⚠️ The pip version does not support all features. Compiling from source is recommended.

Usage

# Predict a single domain
echo "carreeffoursa.site" | ./fasttext predict phishing_model.bin -

Training Info

Framework: FastText
Labels: __label__phishing, __label__clean
Epochs: 10
Learning rate: 0.5
wordNgrams: 2

📊 Training Data

The model was trained on mstfknn/phishing-domain-list-2m-plus, a dataset consisting of 2.000,000 domain names labeled as either phishing or clean.

Example

Input:

carreeffoursa.site

Output:

__label__phishing

License

MIT

🔗 Links

💻 GitHub Repository
🐳 Docker Hub Image

Downloads last month: -

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

mstfknn
/

phishing-fasttext-model