DerivedFunction
/

polyglot-tagger-v2

Token Classification

Generated from Trainer

Model card Files Files and versions

Metrics Training metrics Community

DerivedFunction commited on 1 day ago

Commit

a8c79ae

·

verified ·

1 Parent(s): 000fd56

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -149,7 +149,7 @@ Note that as a general language tagging model, it can potentially get confused f
 The model is trained on a sentence with a minimum of four tokens, so it may not accurately classify very short and ambigous statements. Note that this model is experimental
 and may produce unexpected results compared to generic text classifiers. It is trained on cleaned text, therefore, "messy" text may unexpectedly produce different results.
-> Note that Romanized versions of any language is not included in the training set, such as Romanized Russian, and Hindi.
 ### Training and Evaluation Data
 A synthetic training row consists of 1-4 individual and mostly independent sentences extracted from various sources. The actual training and evaluation data, as well as coverage

 The model is trained on a sentence with a minimum of four tokens, so it may not accurately classify very short and ambigous statements. Note that this model is experimental
 and may produce unexpected results compared to generic text classifiers. It is trained on cleaned text, therefore, "messy" text may unexpectedly produce different results.
+> Note that Romanized versions of any language may only have minor representation in the training set, such as Romanized Russian, and Hindi.
 ### Training and Evaluation Data
 A synthetic training row consists of 1-4 individual and mostly independent sentences extracted from various sources. The actual training and evaluation data, as well as coverage