phishbot
/

Isitphish

Text Classification

text-embeddings-inference

Model card Files Files and versions

phishbot commited on Dec 5, 2023

Commit

3a32597

·

1 Parent(s): 26ca614

Update README.md

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -12,7 +12,7 @@ Using this model, we can classify malicious prompts that can lead towards creati
 Our model, "Is it Phish?" is designed to identify malicious prompts that can be used to generate phishing websites and emails using popular commercial LLMs like ChatGPT, Bard and Claude.
 This model is obtained by finetuning a Pre-Trained RoBERTa using a dataset encompassing multiple sets of malicious prompts, as detailed in our corresponding arXiv paper
-- **Paper:** https://arxiv.org/abs/2310.19181
 Try out "Is it Phish?" using the Inference API. Our model classifies prompts with "Label 1" to signify the identification of a phishing attempt, while "Label 0" denotes a prompt that is considered safe and non-malicious.
@@ -43,9 +43,9 @@ print(model_outputs[0])
 Achieved an accuracy of 96% with an F1-score of 0.96, on test sets distribution, explained in the paper.
-## Citation
-<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
 If you find Isitphish to be useful, please cite it with:
 ```
@@ -57,4 +57,4 @@ If you find Isitphish to be useful, please cite it with:
       archivePrefix={arXiv},
       primaryClass={cs.CR}
 }
-```

 Our model, "Is it Phish?" is designed to identify malicious prompts that can be used to generate phishing websites and emails using popular commercial LLMs like ChatGPT, Bard and Claude.
 This model is obtained by finetuning a Pre-Trained RoBERTa using a dataset encompassing multiple sets of malicious prompts, as detailed in our corresponding arXiv paper
+<!--- **Paper:** https://arxiv.org/abs/2310.19181 -->
 Try out "Is it Phish?" using the Inference API. Our model classifies prompts with "Label 1" to signify the identification of a phishing attempt, while "Label 0" denotes a prompt that is considered safe and non-malicious.
 Achieved an accuracy of 96% with an F1-score of 0.96, on test sets distribution, explained in the paper.
+<!--## Citation
+<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section.
 If you find Isitphish to be useful, please cite it with:
 ```
       archivePrefix={arXiv},
       primaryClass={cs.CR}
 }
+```-->