Update README.md
Browse files
README.md
CHANGED
|
@@ -5,16 +5,21 @@ license: unknown
|
|
| 5 |
# Overview
|
| 6 |
|
| 7 |
<!-- This model is obtained by finetuning Pre-Trained RoBERTa on dataset containing several sets of malicious prompts.
|
| 8 |
-
Using this model, we can classify malicious prompts that can lead towards creation of phishing websites and phishing emails. -->
|
| 9 |
-
This model is obtained by finetuning a Pre-Trained RoBERTa using a dataset encompassing multiple sets of malicious prompts, as detailed in the corresponding arXiv paper.
|
| 10 |
Using this model, we can classify malicious prompts that can lead towards creation of phishing websites and phishing emails.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 11 |
|
| 12 |
- **Paper:**
|
| 13 |
|
|
|
|
|
|
|
| 14 |
## Dataset Details
|
| 15 |
|
| 16 |
-
The dataset utilized for this model
|
| 17 |
-
|
| 18 |
|
| 19 |
## Training Details
|
| 20 |
|
|
|
|
| 5 |
# Overview
|
| 6 |
|
| 7 |
<!-- This model is obtained by finetuning Pre-Trained RoBERTa on dataset containing several sets of malicious prompts.
|
|
|
|
|
|
|
| 8 |
Using this model, we can classify malicious prompts that can lead towards creation of phishing websites and phishing emails.
|
| 9 |
+
This model is obtained by finetuning a Pre-Trained RoBERTa using a dataset encompassing multiple sets of malicious prompts, as detailed in the corresponding arXiv paper.
|
| 10 |
+
Using this model, we can classify malicious prompts that can lead towards creation of phishing websites and phishing emails. -->
|
| 11 |
+
|
| 12 |
+
Our model, "Is it Phish?" is designed to identify malicious prompts that can be used to generate phishing websites and emails using popular commercial LLMs like ChatGPT, Bard and Claude.
|
| 13 |
+
This model is obtained by finetuning a Pre-Trained RoBERTa using a dataset encompassing multiple sets of malicious prompts, as detailed in our corresponding arXiv paper
|
| 14 |
|
| 15 |
- **Paper:**
|
| 16 |
|
| 17 |
+
Try out "Is it Phish?" using the Inference API. Our model classifies prompts with "Label 1" to signify the identification of a phishing attempt, while "Label 0" denotes a prompt that is considered safe and non-malicious.
|
| 18 |
+
|
| 19 |
## Dataset Details
|
| 20 |
|
| 21 |
+
The dataset utilized for training this model has been created using malicious prompts generated by GPT-4.
|
| 22 |
+
Due to ethical concerns, our dataset is currently available only upon request.
|
| 23 |
|
| 24 |
## Training Details
|
| 25 |
|