DerivedFunction
/

polyglot-tagger-60L-Experimental

@@ -127,6 +127,7 @@ The model supports the following ISO-coded languages:
 > Note that Romanized versions of any language is not included in the training set, such as Romanized Russian, and Hindi.
 ### The model scored the following on `papulca/language-identification`'s test set
 |Language     | Correct  |  Total     | Accuracy    |
 |-------------|----------|-------------|--------|
@@ -152,6 +153,64 @@ The model supports the following ISO-coded languages:
 > As the training data is slightly biased toward English text, it may produce tokens for English rather than the target language in the Latin family.
 ### Training hyperparameters
 The following hyperparameters were used during training:

 > Note that Romanized versions of any language is not included in the training set, such as Romanized Russian, and Hindi.
+## Evaluation
 ### The model scored the following on `papulca/language-identification`'s test set
 |Language     | Correct  |  Total     | Accuracy    |
 |-------------|----------|-------------|--------|
 > As the training data is slightly biased toward English text, it may produce tokens for English rather than the target language in the Latin family.
+### The model scored the following on `mikaberidze/lid200`'s test set, which is derived from `Davlan/sib200`
+|Language   |  Correct  |  Total   |   Accuracy
+------------|----------|-----------|-----------
+|af           | 204        | 204             | 100.0%
+|am           | 204        | 204             | 100.0%
+|as           | 204        | 204             | 100.0%
+|be           | 204        | 204             | 100.0%
+|bg           | 204        | 204             | 100.0%
+|bn           | 204        | 204             | 100.0%
+|cs           | 204        | 204             | 100.0%
+|da           | 203        | 204              |99.5%
+|de           | 204        | 204             | 100.0%
+|el           | 204        | 204             | 100.0%
+|en           | 204        | 204             | 100.0%
+|es           | 204        | 204             | 100.0%
+|fi           | 204        | 204             | 100.0%
+|fr           | 204        | 204             | 100.0%
+|gu           | 204        | 204             | 100.0%
+|he           | 204        | 204             | 100.0%
+|hi           | 204        | 204             | 100.0%
+|hu           | 204        | 204             | 100.0%
+|hy           | 204        | 204             | 100.0%
+|id          | 198        | 204              |97.1%
+|is           | 204        | 204             | 100.0%
+|it           | 204        | 204             | 100.0%
+|ja           | 204        | 204             | 100.0%
+|ka           | 204        | 204             | 100.0%
+|kk           | 204        | 204             | 100.0%
+|km           | 204        | 204             | 100.0%
+|kn           | 204        | 204             | 100.0%
+|ko           | 204        | 204             | 100.0%
+|lo           | 204        | 204             | 100.0%
+|mk           | 203        | 204             | 99.5%
+|ml           | 204        | 204             | 100.0%
+|mr           | 204        | 204             | 100.0%
+|my           | 204        | 204             | 100.0%
+|nl           | 203        | 204              |99.5%
+|pa           | 204        | 204             | 100.0%
+|pl           | 204        | 204             | 100.0%
+|pt           | 204        | 204             | 100.0%
+|ro           | 204        | 204             | 100.0%
+|ru           | 204        | 204             | 100.0%
+|sd           | 204        | 204             | 100.0%
+|sr           | 204        | 204             | 100.0%
+|sv           | 204        | 204             | 100.0%
+|ta           | 204        | 204             | 100.0%
+|te           | 204        | 204             | 100.0%
+|th           | 204        | 204             | 100.0%
+|tr           | 204        | 204             | 100.0%
+|ug           | 204        | 204             | 100.0%
+|uk           | 204        | 204             | 100.0%
+|ur           | 204        | 204             | 100.0%
+|vi           | 204        | 204             | 100.0%
+|zh           |408       | 408             | 100.0%
+> Caution: training data include text from Wikipedia and Finetranslations, which may skew the results.
 ### Training hyperparameters
 The following hyperparameters were used during training: