--- datasets: - SHSK0118/BERT-basedDomainClassification_ComplaintTexts_ja language: - ja ---

BERT-based Domain Classification for Japanese Complaint Texts

A BERT-based Japanese text classification model trained for domain classification of complaint texts.


Model Details


Training Data

Training corpus:

BERT-basedDomainClassification_ComplaintTexts_ja Dataset

Dataset split:


Evaluation

Test Accuracy: 73.0%


Performance Discussion

The model was trained on primarily formal written text (Wikimedia-derived corpus), while evaluation was conducted on complaint-style texts.

The domain gap between formal and conversational language likely contributed to reduced performance.


Intended Use


Limitations


Author

Independent implementation by Shota Tokunaga.