kanell0304
/

url-phishing-detector

phishing-detection

url-classification

Model card Files Files and versions

URL Phishing Detector

Model Description

Random Forest 모델로 URL의 악성 여부를 판단하는 이진 분류기입니다.

프로젝트: 스미싱 탐지 시스템 (OCR/QR + URL 위험 판단)
작성자: 이경준

Performance

Accuracy: 100.00%
Precision: 100.00%
Recall: 100.00%
F1-Score: 100.00%

Features

30개의 URL 특징을 사용합니다:

URL 구조 (길이, 특수문자, 하이픈 등)
도메인 특성 (IP 주소, 엔트로피, TLD 등)
콘텐츠 특징 (HTTPS, 의심 키워드, 브랜드 불일치 등)

Usage

import joblib
from huggingface_hub import hf_hub_download

# 모델 다운로드
model_path = hf_hub_download(repo_id="kanell0304/url-phishing-detector", filename="url_classifier.pkl")
model = joblib.load(model_path)

# 예측
# features = extract_features(url)  # 특징 추출 필요
# prediction = model.predict([features])

Training Data

Phishing URLs: PhishTank, URLhaus
Benign URLs: Tranco Top Sites

Limitations

새로운 피싱 패턴에 대한 지속적인 재학습 필요
단축 URL은 확장 후 분석 권장

Citation

@misc{url-phishing-detector,
  author = {이경준},
  title = {URL Phishing Detector},
  year = {2025},
  publisher = {Hugging Face},
  howpublished = {\url{https://huggingface.co/kanell0304/url-phishing-detector}}
}

Downloads last month: -

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support