Upload model_card.md with huggingface_hub

3d48ccd verified 4 months ago

1.11 kB

metadata

language: en
license: other
tags:
  - security
  - phishing-detection
  - url-classification
  - xgboost

Random Forest / XGBoost Model for URL Phishing Detection

Model Details

Architecture: Gradient-boosted decision trees (XGBoost)
Input: Single URL string (no external queries)
Features: Lexical and structural URL features (lengths, symbol counts, digit ratio, IPv4 pattern, common phishing tokens, scheme/TLD heuristics)
Training data: PhiUSIIL_Phishing_URL_Dataset.csv
Intended use: Binary classification (phishing vs. legitimate)

See README.md and inference.py for loading and predict_url().

Provided for research/educational purposes. Ensure compliance with local laws and organizational policies.