Update README.md
Browse files
README.md
CHANGED
|
@@ -8,4 +8,39 @@ library_name: transformers
|
|
| 8 |
tags:
|
| 9 |
- NER
|
| 10 |
license: cc-by-4.0
|
| 11 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 8 |
tags:
|
| 9 |
- NER
|
| 10 |
license: cc-by-4.0
|
| 11 |
+
---
|
| 12 |
+
# est-roberta-ud-ner
|
| 13 |
+
|
| 14 |
+
<!-- Provide a quick summary of what the model is/does. -->
|
| 15 |
+
|
| 16 |
+
### Model Description
|
| 17 |
+
|
| 18 |
+
<!-- Provide a longer summary of what this model is. -->
|
| 19 |
+
est-roberta-ud-ner is an [Est-RoBERTa](https://huggingface.co/EMBEDDIA/est-roberta) based model fine-tuned for named entity recognition in Estonian on the [EDT](https://github.com/UniversalDependencies/UD_Estonian-EDT) and [EWT](https://github.com/UniversalDependencies/UD_Estonian-EWT) datasets.
|
| 20 |
+
|
| 21 |
+
|
| 22 |
+
### How to use
|
| 23 |
+
The model can be used with Transformers pipeline for NER.
|
| 24 |
+
```
|
| 25 |
+
from transformers import pipeline
|
| 26 |
+
|
| 27 |
+
ner = pipeline("ner", model="vbius01/est-roberta-ud-ner")
|
| 28 |
+
|
| 29 |
+
text = "Eesti kuulub erinevalt Lätist ja Leedust kahtlemata Põhjamaade kultuuriruumi."
|
| 30 |
+
results = ner(text)
|
| 31 |
+
|
| 32 |
+
print(results)
|
| 33 |
+
```
|
| 34 |
+
```
|
| 35 |
+
[{'entity': 'B-GEP', 'score': np.float32(0.99339926), 'index': 1, 'word': '▁Eesti', 'start': 0, 'end': 5}, {'entity': 'B-GEP', 'score': np.float32(0.9923631), 'index': 4, 'word': '▁Lätist', 'start': 22, 'end': 29}, {'entity': 'B-GEP', 'score': np.float32(0.990756), 'index': 6, 'word': '▁Leedust', 'start': 32, 'end': 40}, {'entity': 'B-LOC', 'score': np.float32(0.61792), 'index': 8, 'word': '▁Põhjamaade', 'start': 51, 'end': 62}]
|
| 36 |
+
```
|
| 37 |
+
|
| 38 |
+
<!-- Provide the basic links for the model. -->
|
| 39 |
+
|
| 40 |
+
- **Repository:** [Developing a NER Model Based on Treebank Corpora](https://github.com/martinkivisikk/ner_thesis)
|
| 41 |
+
- **Paper:** []()
|
| 42 |
+
|
| 43 |
+
## Uses
|
| 44 |
+
|
| 45 |
+
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
|
| 46 |
+
This model can be used to find named entities from Estonian texts.
|