Update README.md
Browse files
README.md
CHANGED
|
@@ -12,8 +12,8 @@ metrics:
|
|
| 12 |
# Web register classification (multilingual model)
|
| 13 |
|
| 14 |
A multilingual web register classifier, fine-tuned from XLM-RoBERTa-large.
|
| 15 |
-
The model is trained with the multilingual CORE corpora across five languages (English, Finnish, French, Swedish, Turkish) to classify documents based on the CORE taxonomy
|
| 16 |
-
It can predict labels for the 100 languages XLM-RoBERTa-large
|
| 17 |
It is designed to support the development of open language models and for linguists analyzing register variation.
|
| 18 |
|
| 19 |
## Model Details
|
|
|
|
| 12 |
# Web register classification (multilingual model)
|
| 13 |
|
| 14 |
A multilingual web register classifier, fine-tuned from XLM-RoBERTa-large.
|
| 15 |
+
The model is trained with the multilingual CORE corpora across five languages (English, Finnish, French, Swedish, Turkish) to classify documents based on the [CORE taxonomy](https://turkunlp.org/register-annotation-docs/).
|
| 16 |
+
It can predict labels for the 100 languages covered by XLM-RoBERTa-large. The model achieves state-of-the-art performance in classifying web registers for the trained languages and has strong transfer performance (see Evaluation below).
|
| 17 |
It is designed to support the development of open language models and for linguists analyzing register variation.
|
| 18 |
|
| 19 |
## Model Details
|