phishbot
/

Isitphish

Text Classification

text-embeddings-inference

Model card Files Files and versions

phishbot commited on Dec 6, 2023

Commit

6f7096e

·

1 Parent(s): 3a32597

Update README.md

Files changed (1) hide show

README.md +0 -18

README.md CHANGED Viewed

@@ -12,8 +12,6 @@ Using this model, we can classify malicious prompts that can lead towards creati
 Our model, "Is it Phish?" is designed to identify malicious prompts that can be used to generate phishing websites and emails using popular commercial LLMs like ChatGPT, Bard and Claude.
 This model is obtained by finetuning a Pre-Trained RoBERTa using a dataset encompassing multiple sets of malicious prompts, as detailed in our corresponding arXiv paper
-<!--- **Paper:** https://arxiv.org/abs/2310.19181 -->
 Try out "Is it Phish?" using the Inference API. Our model classifies prompts with "Label 1" to signify the identification of a phishing attempt, while "Label 0" denotes a prompt that is considered safe and non-malicious.
 ## Dataset Details
@@ -42,19 +40,3 @@ print(model_outputs[0])
 ### Results
 Achieved an accuracy of 96% with an F1-score of 0.96, on test sets distribution, explained in the paper.
-<!--## Citation
-<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section.
-If you find Isitphish to be useful, please cite it with:
-```
-@misc{roy2023chatbots,
-      title={From Chatbots to PhishBots? -- Preventing Phishing scams created using ChatGPT, Google Bard and Claude},
-      author={Sayak Saha Roy and Poojitha Thota and Krishna Vamsi Naragam and Shirin Nilizadeh},
-      year={2023},
-      eprint={2310.19181},
-      archivePrefix={arXiv},
-      primaryClass={cs.CR}
-}
-```-->

 Our model, "Is it Phish?" is designed to identify malicious prompts that can be used to generate phishing websites and emails using popular commercial LLMs like ChatGPT, Bard and Claude.
 This model is obtained by finetuning a Pre-Trained RoBERTa using a dataset encompassing multiple sets of malicious prompts, as detailed in our corresponding arXiv paper
 Try out "Is it Phish?" using the Inference API. Our model classifies prompts with "Label 1" to signify the identification of a phishing attempt, while "Label 0" denotes a prompt that is considered safe and non-malicious.
 ## Dataset Details
 ### Results
 Achieved an accuracy of 96% with an F1-score of 0.96, on test sets distribution, explained in the paper.