Update README.md
Browse files
README.md
CHANGED
|
@@ -63,6 +63,8 @@ The training data was aggregated from multiple sources:
|
|
| 63 |
| Manifesto PN | 15 | Done | OpenAI API |
|
| 64 |
| Synthetic Data | 4124 | Done | OpenAI API |
|
| 65 |
|
|
|
|
|
|
|
| 66 |
---
|
| 67 |
|
| 68 |
## Labeling Method Details
|
|
|
|
| 63 |
| Manifesto PN | 15 | Done | OpenAI API |
|
| 64 |
| Synthetic Data | 4124 | Done | OpenAI API |
|
| 65 |
|
| 66 |
+
- **NOTE**: The originally aggregated dataset, which included data from various sources (such as English Newspapers, Facebook comments, Malay, Chinese, and Tamil Newspapers, Reddit, Manifestos, and Synthetic Data), contained some noise and misclassifications; after removing these noisy entries, 47,966 clean data points were used for training.
|
| 67 |
+
|
| 68 |
---
|
| 69 |
|
| 70 |
## Labeling Method Details
|