mstfknn/phishing-domain-list-2m-plus
Viewer β’ Updated β’ 5.23M β’ 45
How to use mstfknn/phishing-fasttext-model with fastText:
from huggingface_hub import hf_hub_download
import fasttext
model = fasttext.load_model(hf_hub_download("mstfknn/phishing-fasttext-model", "model.bin"))This is a lightweight FastText model trained to classify domain names as either phishing or clean. It uses supervised learning with wordNgrams=2 for better n-gram feature coverage.
Option 1: From Source
git clone https://github.com/facebookresearch/fastText.git
cd fastText
mkdir build && cd build
cmake ..
make
Option 2: Using pip (limited support)
pip install fasttext
β οΈ The pip version does not support all features. Compiling from source is recommended.
# Predict a single domain
echo "carreeffoursa.site" | ./fasttext predict phishing_model.bin -
__label__phishing, __label__cleanThe model was trained on mstfknn/phishing-domain-list-2m-plus, a dataset consisting of 2.000,000 domain names labeled as either phishing or clean.
Input:
carreeffoursa.site
Output:
__label__phishing
MIT
from huggingface_hub import hf_hub_download import fasttext model = fasttext.load_model(hf_hub_download("mstfknn/phishing-fasttext-model", "model.bin"))