Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,29 @@
|
|
| 1 |
---
|
| 2 |
license: afl-3.0
|
| 3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
license: afl-3.0
|
| 3 |
---
|
| 4 |
+
# Suicidal
|
| 5 |
+
This text categorization model can predict if a word sequence is suicidal (1) or not (0).
|
| 6 |
+
|
| 7 |
+
## Data
|
| 8 |
+
The model was trained on the [Suicide and Depression Dataset](https://www.kaggle.com/nikhileswarkomati/suicide-watch) obtained from Kaggle. The dataset was taken from Reddit and contains 232,074 data divided into two categories: suicide and non-suicide.
|
| 9 |
+
|
| 10 |
+
## Parameters
|
| 11 |
+
The model fine-tuning was conducted on 1 epoch, with batch size of 6, and learning rate of 0.00001. Due to limited computing resources and time, we were unable to scale up the number of epochs and batch size.
|
| 12 |
+
|
| 13 |
+
## Performance
|
| 14 |
+
Following fine-tuning on the mentioned dataset, the model generated the subsequent results:
|
| 15 |
+
- Accuracy: 0.9792
|
| 16 |
+
- Recall: 0.9788
|
| 17 |
+
- Precision: 0.9677
|
| 18 |
+
- F1 Score: 0.9732
|
| 19 |
+
|
| 20 |
+
## How to Use
|
| 21 |
+
Import the model from the transformers library:
|
| 22 |
+
```
|
| 23 |
+
from transformers import AutoTokenizer, AutoModel
|
| 24 |
+
tokenizer = AutoTokenizer.from_pretrained("Aryan4/suicidal")
|
| 25 |
+
model = AutoModel.from_pretrained("Aryan$/suicidal")
|
| 26 |
+
```
|
| 27 |
+
|
| 28 |
+
## Resources
|
| 29 |
+
For more resources, including the source code, please refer to the GitHub repository [Aryanstha/suicidal-text-detection](https://github.com/Aryanstha/).
|