Update README.md
Browse files
README.md
CHANGED
|
@@ -58,7 +58,8 @@ This will return a list of recognized tokens marked with label 'INSTRUCTION'.
|
|
| 58 |
|
| 59 |
## Training
|
| 60 |
|
| 61 |
-
|
|
|
|
| 62 |
|
| 63 |
## Evaluation
|
| 64 |
|
|
|
|
| 58 |
|
| 59 |
## Training
|
| 60 |
|
| 61 |
+
It's based on the transformer architecture and specifically uses the [xlm-roberta-base-uk](https://huggingface.co/ukr-models/xlm-roberta-base-uk) model from `ukr-models`, fine-tuned for the token classification task. The training data was carefully chosen to include a balanced distribution of titles containing instructions and those not containing instructions.
|
| 62 |
+
The dataset contains newspaper titles (~3k titles), with tokens representing instructions manually labeled.
|
| 63 |
|
| 64 |
## Evaluation
|
| 65 |
|