Update README.md
Browse files
README.md
CHANGED
|
@@ -82,9 +82,9 @@ The model was developed for a very narrow use case in a research project and fin
|
|
| 82 |
As a consequence, its performance could be much worse when applied to texts from other domains (e.g. types of texts other than news articles, texts from other periods of time).
|
| 83 |
|
| 84 |
In addition, model output should be checked and post-processed before further use for at least three reasons:
|
| 85 |
-
Sometimes, only some subwords of a journal name are tagged as journal names
|
| 86 |
-
In related cases, tokens inside a journal name are occasionally not identified as a part of the name, leading to the detection of two separate names
|
| 87 |
-
And finally, the model has a slight tendency to also extract names of non-scientific magazines or media outlets when they are presented in a similar context
|
| 88 |
|
| 89 |
## Training data
|
| 90 |
|
|
|
|
| 82 |
As a consequence, its performance could be much worse when applied to texts from other domains (e.g. types of texts other than news articles, texts from other periods of time).
|
| 83 |
|
| 84 |
In addition, model output should be checked and post-processed before further use for at least three reasons:
|
| 85 |
+
Sometimes, only some subwords of a journal name are tagged as journal names.
|
| 86 |
+
In related cases, tokens inside a journal name are occasionally not identified as a part of the name, leading to the detection of two separate names.
|
| 87 |
+
And finally, the model has a slight tendency to also extract names of non-scientific magazines or media outlets when they are presented in a similar context.
|
| 88 |
|
| 89 |
## Training data
|
| 90 |
|