Update README.md
Browse files
README.md
CHANGED
|
@@ -101,7 +101,19 @@ def predict_mask(sent, tokenizer, model, topk=10, print_results=True):
|
|
| 101 |
|
| 102 |
return words
|
| 103 |
```
|
|
|
|
| 104 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 105 |
|
| 106 |
# Reference
|
| 107 |
```
|
|
|
|
| 101 |
|
| 102 |
return words
|
| 103 |
```
|
| 104 |
+
Limitations & Risks
|
| 105 |
|
| 106 |
+
* Domain-Specific Bias: SecureBERT is trained primarily on cybersecurity-related text. It may underperform on tasks outside this domain compared to general-purpose models.
|
| 107 |
+
|
| 108 |
+
* Data Quality: The training data was collected from online sources. As such, it may contain inaccuracies, outdated terminology, or biased representations of cybersecurity threats and behaviors.
|
| 109 |
+
|
| 110 |
+
* Potential Misuse: While the model is intended for defensive cybersecurity research, it could potentially be misused to generate malicious text (e.g., obfuscating malware descriptions or aiding adversarial tactics).
|
| 111 |
+
|
| 112 |
+
* Not a Substitute for Expertise: Predictions made by the model should not be considered authoritative. Cybersecurity professionals must validate results before applying them in critical systems or operational contexts.
|
| 113 |
+
|
| 114 |
+
* Evolving Threat Landscape: Cyber threats evolve rapidly, and the model may become outdated without continuous retraining on fresh data.
|
| 115 |
+
|
| 116 |
+
* Users should apply SecureBERT responsibly, keeping in mind its limitations and the need for human oversight in all security-critical applications.
|
| 117 |
|
| 118 |
# Reference
|
| 119 |
```
|