BERT-based Domain Classification for Japanese Complaint Texts

A BERT-based Japanese text classification model trained for domain classification of complaint texts.

Model Details

Training corpus:

Dataset split:

Test Accuracy: 73.0%

The model was trained on primarily formal written text (Wikimedia-derived corpus), while evaluation was conducted on complaint-style texts.

The domain gap between formal and conversational language likely contributed to reduced performance.

Independent implementation by Shota Tokunaga.

Safetensors

Model size

69.7M params

Tensor type

F32

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support