Update README.md
Browse files
README.md
CHANGED
|
@@ -14,8 +14,10 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 14 |
|
| 15 |
# polibert-malaysia-ver3
|
| 16 |
|
| 17 |
-
This model is
|
| 18 |
-
|
|
|
|
|
|
|
| 19 |
- Loss: 2.2001
|
| 20 |
- Accuracy: 0.6961
|
| 21 |
|
|
@@ -45,6 +47,19 @@ The following hyperparameters were used during training:
|
|
| 45 |
- num_epochs: 16
|
| 46 |
- mixed_precision_training: Native AMP
|
| 47 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 48 |
### Training results
|
| 49 |
|
| 50 |
| Training Loss | Epoch | Step | Validation Loss | Accuracy |
|
|
|
|
| 14 |
|
| 15 |
# polibert-malaysia-ver3
|
| 16 |
|
| 17 |
+
This model is new version of YagiASAFAS/polibert-malaysia-ver2.
|
| 18 |
+
What is new is that this model used a new dataset which not only used tnwei/ms-newspapers dataset but also almost 10k of instagram posts regarding several topics about Malaysia.
|
| 19 |
+
By doing so, this model captures not only formal sentences such as News, but also captures informal sentences such as personal posts.
|
| 20 |
+
As a tradeoff, the accuracy was quite lower compared to the previous one
|
| 21 |
- Loss: 2.2001
|
| 22 |
- Accuracy: 0.6961
|
| 23 |
|
|
|
|
| 47 |
- num_epochs: 16
|
| 48 |
- mixed_precision_training: Native AMP
|
| 49 |
|
| 50 |
+
### Label Mappings
|
| 51 |
+
- 0: Economic Concerns
|
| 52 |
+
- 1: Racial discrimination or polarization
|
| 53 |
+
- 2: Leadership weaknesses
|
| 54 |
+
- 3: Development and infrastructure gaps
|
| 55 |
+
- 4: Corruption
|
| 56 |
+
- 5: Political instablility
|
| 57 |
+
- 6: Socials and Public safety
|
| 58 |
+
- 7: Administration
|
| 59 |
+
- 8: Education
|
| 60 |
+
- 9: Religion issues
|
| 61 |
+
- 10: Environmental
|
| 62 |
+
|
| 63 |
### Training results
|
| 64 |
|
| 65 |
| Training Loss | Epoch | Step | Validation Loss | Accuracy |
|