phishbot
/

Isitphish

Text Classification

text-embeddings-inference

Model card Files Files and versions

phishbot commited on Oct 30, 2023

Commit

4a28821

·

1 Parent(s): e2b3957

Update README.md

Files changed (1) hide show

README.md +9 -4

README.md CHANGED Viewed

@@ -5,16 +5,21 @@ license: unknown
 # Overview
 <!-- This model is obtained by finetuning Pre-Trained RoBERTa on dataset containing several sets of malicious prompts.
-Using this model, we can classify malicious prompts that can lead towards creation of phishing websites and phishing emails.  -->
-This model is obtained by finetuning a Pre-Trained RoBERTa using a dataset encompassing multiple sets of malicious prompts, as detailed in the corresponding arXiv paper.
 Using this model, we can classify malicious prompts that can lead towards creation of phishing websites and phishing emails.
 - **Paper:**
 ## Dataset Details
-The dataset utilized for this model is constructed from malicious prompts generated by GPT-4.
-We have decided not to make it publicly available. However, it will be provided upon request.
 ## Training Details

 # Overview
 <!-- This model is obtained by finetuning Pre-Trained RoBERTa on dataset containing several sets of malicious prompts.
 Using this model, we can classify malicious prompts that can lead towards creation of phishing websites and phishing emails.
+This model is obtained by finetuning a Pre-Trained RoBERTa using a dataset encompassing multiple sets of malicious prompts, as detailed in the corresponding arXiv paper.
+Using this model, we can classify malicious prompts that can lead towards creation of phishing websites and phishing emails. -->
+Our model, "Is it Phish?" is designed to identify malicious prompts that can be used to generate phishing websites and emails using popular commercial LLMs like ChatGPT, Bard and Claude.
+This model is obtained by finetuning a Pre-Trained RoBERTa using a dataset encompassing multiple sets of malicious prompts, as detailed in our corresponding arXiv paper
 - **Paper:**
+Try out "Is it Phish?" using the Inference API. Our model classifies prompts with "Label 1" to signify the identification of a phishing attempt, while "Label 0" denotes a prompt that is considered safe and non-malicious.
 ## Dataset Details
+The dataset utilized for training this model has been created using malicious prompts generated by GPT-4.
+Due to ethical concerns, our dataset is currently available only upon request.
 ## Training Details