Update README.md
Browse files
README.md
CHANGED
|
@@ -14,17 +14,18 @@ probably proofread and complete it, then remove this comment. -->
|
|
| 14 |
|
| 15 |
# distilbert-hatespeech-classifier
|
| 16 |
|
|
|
|
|
|
|
| 17 |
This model fine-tunes distilbert-base-uncased on the Davidson et al. (2017) hate speech and offensive language dataset loaded from HuggingFace. The classifier predicts whether a tweet is:
|
| 18 |
|
| 19 |
-
(a) hate speech
|
| 20 |
-
(b) offensive but not hate
|
| 21 |
-
(c) neither
|
| 22 |
|
| 23 |
-
|
| 24 |
|
| 25 |
-
|
| 26 |
|
| 27 |
-
More information needed
|
| 28 |
|
| 29 |
## Intended uses & limitations
|
| 30 |
|
|
|
|
| 14 |
|
| 15 |
# distilbert-hatespeech-classifier
|
| 16 |
|
| 17 |
+
## Model description
|
| 18 |
+
|
| 19 |
This model fine-tunes distilbert-base-uncased on the Davidson et al. (2017) hate speech and offensive language dataset loaded from HuggingFace. The classifier predicts whether a tweet is:
|
| 20 |
|
| 21 |
+
- (a) hate speech
|
| 22 |
+
- (b) offensive but not hate
|
| 23 |
+
- (c) neither
|
| 24 |
|
| 25 |
+
Using a frozen DistilBERT base and a custom dense head.
|
| 26 |
|
| 27 |
+
The architecture consists of three dense layers (256 → 128 → 32, LeakyReLU and Swish activations), with dropout and batch normalization to improve generalization.
|
| 28 |
|
|
|
|
| 29 |
|
| 30 |
## Intended uses & limitations
|
| 31 |
|