DerivedFunction
/

polyglot-tagger-60L-Experimental

Token Classification

language-detection

language-identification

Model card Files Files and versions

Metrics Training metrics Community

DerivedFunction commited on 2 days ago

Commit

30f642f

·

verified ·

1 Parent(s): c44bdb4

Update README.md

Files changed (1) hide show

README.md +5 -3

README.md CHANGED Viewed

@@ -85,7 +85,7 @@ language:
 ---
-# Polyglot Tagger: 60L
 This model is a fine-tuned version of [xlm-roberta-base](https://huggingface.co/xlm-roberta-base).
 It achieves the following results on the evaluation set:
@@ -97,8 +97,8 @@ It achieves the following results on the evaluation set:
 ## Model description
-Introducing Polyglot Tagger 60L, a new way to classify multi-lingual documents. By training specifically on token classification on individual sentences, the model generalizes well
-on a variety of languages, while also behaves as a multi-label classifier, and extracts sentences based on its language.
 ## Intended uses & limitations
 This model can be treated as a base model for further fine-tuning on specific language identification extraction tasks.
@@ -171,6 +171,8 @@ Top token languages:
   ko      3958
 ## Evaluation
 ### The model scored the following on `papulca/language-identification`'s test set
 |Language     | Correct  |  Total     | Accuracy    |
 |-------------|----------|-------------|--------|

 ---
+# Polyglot Tagger: 60L (Experimental)
 This model is a fine-tuned version of [xlm-roberta-base](https://huggingface.co/xlm-roberta-base).
 It achieves the following results on the evaluation set:
 ## Model description
+Introducing Polyglot Tagger 60L, a new way to classify multi-lingual documents. By training specifically on token classification on individual sentences, the model
+generalizes well on a variety of languages, while also behaves as a multi-label classifier, and extracts sentences based on its language.
 ## Intended uses & limitations
 This model can be treated as a base model for further fine-tuning on specific language identification extraction tasks.
   ko      3958
 ## Evaluation
+> Please note that these results are not indicative that token classification can substitute for sequence classification.
 ### The model scored the following on `papulca/language-identification`'s test set
 |Language     | Correct  |  Total     | Accuracy    |
 |-------------|----------|-------------|--------|