CySent
Collection
Next-gen AI phishing protection that fuses signature checks with a multi-model ensemble for URLs, emails, senders, and files
•
5 items
•
Updated
This model is a fine-tuned version of google-bert/bert-base-uncased on the zefang-liu/phishing-email-dataset and kxm1k4m1/generate_phishing_email_final datasets. It achieves the following results on the evaluation set:
from transformers import AutoTokenizer, AutoModelForSequenceClassification
tokenizer = AutoTokenizer.from_pretrained("RamzyBakir/jellyphish-bert-base-mail")
model = AutoModelForSequenceClassification.from_pretrained("RamzyBakir/jellyphish-bert-base-mail)
inputs = tokenizer("Your email text here", return_tensors="pt")
outputs = model(**inputs)
predicted_class = outputs.logits.argmax(-1).item()
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Accuracy | Macro F1 | Weighted F1 | Precision | Recall |
|---|---|---|---|---|---|---|---|---|
| 0.5127 | 1.0 | 238 | 0.3615 | 0.8887 | 0.8885 | 0.8886 | 0.8893 | 0.8883 |
| 0.3214 | 2.0 | 476 | 0.2437 | 0.9157 | 0.9157 | 0.9157 | 0.9156 | 0.9158 |
| 0.2497 | 3.0 | 714 | 0.2136 | 0.9193 | 0.9192 | 0.9193 | 0.9202 | 0.9189 |
| 0.2257 | 4.0 | 952 | 0.1937 | 0.9253 | 0.9252 | 0.9252 | 0.9258 | 0.9249 |
| 0.2076 | 5.0 | 1190 | 0.1766 | 0.9309 | 0.9308 | 0.9309 | 0.9308 | 0.9309 |
| 0.201 | 6.0 | 1428 | 0.1751 | 0.9322 | 0.9321 | 0.9321 | 0.9325 | 0.9319 |
| 0.1959 | 7.0 | 1666 | 0.1714 | 0.9361 | 0.9361 | 0.9361 | 0.9364 | 0.9359 |
| 0.1944 | 8.0 | 1904 | 0.1676 | 0.9355 | 0.9354 | 0.9355 | 0.9356 | 0.9353 |
| 0.1904 | 9.0 | 2142 | 0.1648 | 0.9368 | 0.9367 | 0.9368 | 0.9368 | 0.9367 |
| 0.1912 | 10.0 | 2380 | 0.1658 | 0.9365 | 0.9364 | 0.9364 | 0.9366 | 0.9363 |