YagiASAFAS
/

PoliBERT-MY

Model card Files Files and versions

YagiASAFAS commited on Apr 5, 2025

Commit

ea66ee2

·

verified ·

1 Parent(s): 8ad97b5

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -63,6 +63,8 @@ The training data was aggregated from multiple sources:
 | Manifesto PN                          | 15    | Done   | OpenAI API                                                       |
 | Synthetic Data                        | 4124  | Done   | OpenAI API                                                       |
 ---
 ## Labeling Method Details

 | Manifesto PN                          | 15    | Done   | OpenAI API                                                       |
 | Synthetic Data                        | 4124  | Done   | OpenAI API                                                       |
+- **NOTE**: The originally aggregated dataset, which included data from various sources (such as English Newspapers, Facebook comments, Malay, Chinese, and Tamil Newspapers, Reddit, Manifestos, and Synthetic Data), contained some noise and misclassifications; after removing these noisy entries, 47,966 clean data points were used for training.
 ---
 ## Labeling Method Details