farsi_lastname_classifier_bert

This model is trained to classify Iranian last names. To use it, type a last name in the space provided on the right and then click on "compute". The model computes probability of the last name being Persian. The compute takes a few seconds to load for the first try (because it needs to load the model first). Subsequent attempt should take only milliseconds. In practice the model can compute the results for an entire batch of data (last names) in a fraction of a second.

It achieves the following results on the evaluation set:

Loss: 0.0863
Accuracy: 0.976

Model description

Model is based on Bert ("bert-base-cased")

The following hyperparameters were used during training:

learning_rate: 1e-05
train_batch_size: 128
eval_batch_size: 256
seed: 42
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: cosine
lr_scheduler_warmup_ratio: 0.1
num_epochs: 10
mixed_precision_training: Native AMP

Training results

Training Loss	Epoch	Step	Validation Loss	Accuracy
No log	1.0	12	0.6325	0.588
No log	2.0	24	0.3414	0.952
No log	3.0	36	0.2496	0.97
No log	4.0	48	0.1674	0.976
No log	5.0	60	0.1160	0.976
No log	6.0	72	0.0917	0.972
No log	7.0	84	0.0896	0.974
No log	8.0	96	0.0874	0.974
No log	9.0	108	0.0869	0.974
No log	10.0	120	0.0863	0.976

Framework versions

Transformers 4.24.0
Pytorch 1.12.1+cu113
Datasets 2.6.1
Tokenizers 0.13.2

Downloads last month: 4