Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,21 @@
|
|
| 1 |
-
---
|
| 2 |
-
license: mit
|
| 3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: mit
|
| 3 |
+
language:
|
| 4 |
+
- hr
|
| 5 |
+
- sl
|
| 6 |
+
- en
|
| 7 |
+
tags:
|
| 8 |
+
- CroSloEngual
|
| 9 |
+
- ner
|
| 10 |
+
---
|
| 11 |
+
|
| 12 |
+
This is a multilingual NER system trained using a Frustratingly Easy Domain Adaptation architecture. It is based on CroSloEngual (https://huggingface.co/EMBEDDIA/crosloengual-bert) and supports different tagsets all using IOBES formats:
|
| 13 |
+
1. Wikiann (LOC, PER, ORG)
|
| 14 |
+
2. SlavNER 19/21 (EVT, LOC, ORG, PER, PRO)
|
| 15 |
+
3. SSJ500k (LOC, MISC, ORG, PER)
|
| 16 |
+
|
| 17 |
+
PER: person, LOC: location, ORG: organization, EVT: event, PRO: product, MISC: Miscellaneous, MEDIA: media, ART: Artifact, TIME: time, DATE: date
|
| 18 |
+
|
| 19 |
+
You can select the tagset to use in the output by configuring the model. This model manages differently uppercase words.
|
| 20 |
+
|
| 21 |
+
More information about the model can be found in the paper (https://aclanthology.org/2021.bsnlp-1.12.pdf) and GitHub repository (https://github.com/EMBEDDIA/NER_FEDA).
|