--- datasets: - SHSK0118/BERT-basedDomainClassification_ComplaintTexts_ja language: - ja ---
A BERT-based Japanese text classification model trained for domain classification of complaint texts.
Training corpus:
BERT-basedDomainClassification_ComplaintTexts_ja Dataset
Dataset split:
Test Accuracy: 73.0%
The model was trained on primarily formal written text (Wikimedia-derived corpus), while evaluation was conducted on complaint-style texts.
The domain gap between formal and conversational language likely contributed to reduced performance.
Independent implementation by Shota Tokunaga.