DerivedFunction
/

polyglot-tagger-v2.2

Token Classification

Generated from Trainer

language-identification

Model card Files Files and versions

Metrics Training metrics Community

DerivedFunction commited on 28 days ago

Commit

56c697e

·

verified ·

1 Parent(s): cf4d2a7

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -145,7 +145,7 @@ generalizes well on a variety of languages, while also behaves as a multi-label
 ## Intended uses & limitations
 This model can be treated as a base model for further fine-tuning on specific language identification extraction tasks.
-Note that as a general language tagging model, it can potentially get confused from shared language families or from short texts. For example, English and German, Spanish and Portuguese, and Russian and Ukrainian.
 The model is trained on a sentence with a minimum of four tokens, so it may not accurately classify very short and ambigous statements. Note that this model is experimental
 and may produce unexpected results compared to generic text classifiers. It is trained on cleaned text, therefore, "messy" text may unexpectedly produce different results.

 ## Intended uses & limitations
 This model can be treated as a base model for further fine-tuning on specific language identification extraction tasks.
+Note that as a general language tagging model, it can potentially get confused from shared language families or from short texts. For example, Danish and Norwegian, Spanish and Portuguese, and Russian and Ukrainian.
 The model is trained on a sentence with a minimum of four tokens, so it may not accurately classify very short and ambigous statements. Note that this model is experimental
 and may produce unexpected results compared to generic text classifiers. It is trained on cleaned text, therefore, "messy" text may unexpectedly produce different results.