Create README.md
Browse files
README.md
ADDED
|
@@ -0,0 +1,37 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
ππ€ **Binary URL Classifier with BERT**
|
| 2 |
+
|
| 3 |
+
Welcome to our Binary URL Classifier! π
|
| 4 |
+
|
| 5 |
+
### What it Does?
|
| 6 |
+
|
| 7 |
+
This classifier categorizes URLs into two categories: Good (Non-Malicious) and Bad (Malicious). It uses BERT, a powerful language model, to understand the content and context of URLs for accurate predictions.
|
| 8 |
+
|
| 9 |
+
### How it Works?
|
| 10 |
+
|
| 11 |
+
1. **Data**: Trained on a curated dataset of good and bad URLs from this repo https://github.com/faizann24/Using-machine-learning-to-detect-malicious-URLs/tree/master?tab=readme-ov-file.
|
| 12 |
+
|
| 13 |
+
2. **Training**: Fine-tuned a pre-trained BERT model using Hugging Face's Transformers library.
|
| 14 |
+
|
| 15 |
+
3. **Evaluation**: Validation(15%) and Test(15%) set kept Aside.
|
| 16 |
+
|
| 17 |
+
4. **Inference**: Provides probability scores for new URLs, indicating their likelihood of being good or bad.
|
| 18 |
+
|
| 19 |
+
### Usage π
|
| 20 |
+
|
| 21 |
+
1. **Input**: Provide a URL to classify.
|
| 22 |
+
|
| 23 |
+
2. **Output**: Receive a probability score for its safety.
|
| 24 |
+
|
| 25 |
+
### Why Trust Us? π
|
| 26 |
+
|
| 27 |
+
- **Robust Training**: Trained on diverse and representative datasets.
|
| 28 |
+
|
| 29 |
+
- **Evaluation**: Rigorously tested for accuracy and reliability.
|
| 30 |
+
|
| 31 |
+
- **Simple & Effective**: Empowers users to make informed decisions about URL safety.
|
| 32 |
+
|
| 33 |
+
### Let's Make the Web Safer! π
|
| 34 |
+
|
| 35 |
+
Our Binary URL Classifier with BERT is here to enhance online safety. Whether you're a security professional, developer, or concerned user, it helps identify malicious URLs for proactive protection.
|
| 36 |
+
|
| 37 |
+
Try it now and experience the power of BERT in URL classification! We're open to feedback and suggestions for continuous improvement. Stay safe online! ππ‘οΈ
|