SamSec007
/

phishbyte

Text Classification

phishing-detection

no-pretrained-weights

cascading-inference

Eval Results (legacy)

Model card Files Files and versions

phishbyte / README.md

SamSec007's picture

Upload model card

deec82a verified about 17 hours ago

|

History Blame Contribute Delete

4.1 kB

	---
	language: en
	license: mit
	library_name: phishbyte
	pipeline_tag: text-classification
	tags:
	- phishing-detection
	- email-security
	- pytorch
	- from-scratch
	- no-pretrained-weights
	- cascading-inference
	- lightweight
	- explainable-ai
	datasets:
	- CEAS-2008
	metrics:
	- f1
	- precision
	- recall
	- accuracy
	model-index:
	- name: phishbyte
	results:
	- task:
	type: text-classification
	name: Phishing Email Detection
	dataset:
	name: CEAS-2008
	type: ceas-2008
	metrics:
	- type: f1
	value: 0.948
	- type: accuracy
	value: 0.944
	- type: precision
	value: 0.954
	- type: recall
	value: 0.943
	---

	# Phish_Byte

	A from-scratch PyTorch model for email phishing detection.
	F1 0.948 on CEAS-2008. 12,545 parameters (≈9,000× smaller than DistilBERT).
	1,500+ emails/sec on a laptop GPU. Every verdict explains itself.

	## Why this exists

	Every phishing detection model on HuggingFace is currently a fine-tuned
	transformer (DistilBERT, BERT, RoBERTa) — 65 to 110 million parameters,
	~250 MB on disk, ~50 ms per email on GPU. Phish_Byte takes a different
	bet: a small custom MLP trained from scratch, fed by 29 carefully chosen
	features, routed through a cascading inference pipeline. The model is
	9,000× smaller than DistilBERT, performs competitively, deploys
	without a GPU, and explains every decision.

	## Usage

	```python
	from phishbyte import PhishByteEngine

	engine = PhishByteEngine.from_pretrained("AnonymousSingh-007/phishbyte")
	verdict = engine.analyze(raw_email_string)

	print(verdict.label) # 'phishing'
	print(verdict.probability) # 0.9735
	print(verdict.confidence) # 'high'
	print(verdict.layer_used) # 2 — MLP made this call
	print(verdict.feature_weights) # full per-feature attribution
	```

	## Architecture

	```
	Layer 1 — rule scorers (~1 ms): domain + URL + SPF + subject
	│
	├──► obvious phishing? short-circuit verdict
	│
	└──► otherwise route to MLP
	│
	Layer 2 — MLP (~3 ms): 29 → 96 → 48 → 1 (sigmoid)
	│
	▼
	PhishVerdict {label, probability, confidence, layer_used, feature_weights}
	```

	## Performance (CEAS-2008, n=2000 held-out)

	\| Metric \| Value \|
	\|------------------\|----------:\|
	\| F1 score \| 0.948 \|
	\| Accuracy \| 94.40% \|
	\| Precision \| 0.9537 \|
	\| Recall \| 0.9432 \|
	\| Parameters \| 12,545 \|
	\| Model size \| ~50 KB \|
	\| Throughput (GPU) \| 1,527 /s \|
	\| Throughput (CPU) \| ~800 /s \|

	## Features (29 inputs)

	- Domain (5): From/Reply-To/Return-Path mismatch, freemail flag, brand impersonation
	- URL (5): HTTPS ratio, anchor mismatch, suspicious TLD, urgency, link density
	- SPF (3): SPF fail, no record, no sending IP
	- Subject (7): urgency, security theme, brand name, currency, all caps, fake RE, fake transaction ID
	- Character-level (5): caps ratio, digit ratio, special chars, avg word length, HTML/text ratio
	- Composite (4): per-layer normalized scores

	## Limitations

	- ~5% of decisions are wrong (F1 0.948, not 1.0). Use as one signal in defence-in-depth, not the only gate.
	- Trained on CEAS-2008 — English-language phishing from 2008. Modern attack patterns and non-English emails will degrade performance.
	- SPF validation is bypassed for training (historical domains don't resolve) but runs live at inference time.
	- Adversarial emails crafted specifically to game these features will get through.

	## Citation

	```bibtex
	@software{phishbyte2026,
	author = {Singh, Samratth},
	title = {Phish_Byte: A cascading from-scratch PyTorch model for email phishing detection},
	year = {2026},
	url = {https://github.com/AnonymousSingh-007/Phish_Byte}
	}
	```

	## License

	MIT