Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,53 @@
|
|
| 1 |
---
|
|
|
|
| 2 |
license: gpl-3.0
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
+
language: es
|
| 3 |
license: gpl-3.0
|
| 4 |
+
tags:
|
| 5 |
+
- spaCy
|
| 6 |
+
- Token Classification
|
| 7 |
+
widget:
|
| 8 |
+
- text: "Fue antes de llegar a Sigüeiro, en el Camino de Santiago."
|
| 9 |
+
- text: "El proyecto lo financia el Ministerio de Industria y Competitividad."
|
| 10 |
+
model-index:
|
| 11 |
+
- name: es_spacy_ner_cds
|
| 12 |
+
results: []
|
| 13 |
---
|
| 14 |
+
|
| 15 |
+
# Introduction
|
| 16 |
+
|
| 17 |
+
spaCy NER model trained in the domain of tourism related to the Way of Saint Jacques. It recognizes four types of entities: location (LOC), organizations (ORG), person (PER) and miscellaneous (MISC).
|
| 18 |
+
|
| 19 |
+
## Usage
|
| 20 |
+
|
| 21 |
+
You can use this model with the spaCy *pipeline* for NER.
|
| 22 |
+
|
| 23 |
+
```python
|
| 24 |
+
import spacy
|
| 25 |
+
from spacy.pipeline import merge_entities
|
| 26 |
+
|
| 27 |
+
|
| 28 |
+
nlp = spacy.load("es_spacy_ner_cds")
|
| 29 |
+
nlp.add_pipe('sentencizer')
|
| 30 |
+
|
| 31 |
+
example = "Fue antes de llegar a Sigüeiro, en el Camino de Santiago. El proyecto lo financia el Ministerio de Industria y Competitiv
|
| 32 |
+
idad."
|
| 33 |
+
ner_pipe = nlp(example)
|
| 34 |
+
|
| 35 |
+
print(ner_pipe.ents)
|
| 36 |
+
for token in merge_entities(ner_pipe):
|
| 37 |
+
print(token.text, token.ent_type_)
|
| 38 |
+
```
|
| 39 |
+
|
| 40 |
+
## Dataset
|
| 41 |
+
|
| 42 |
+
ToDo
|
| 43 |
+
|
| 44 |
+
## Model performance
|
| 45 |
+
|
| 46 |
+
entity|precision|recall|f1
|
| 47 |
+
-|-|-|-
|
| 48 |
+
PER|0.942|0.890|0.915
|
| 49 |
+
ORG|0.869|0.688|0.768
|
| 50 |
+
LOC|0.975|0.987|0.981
|
| 51 |
+
MISC|0.854|0.757|0.803
|
| 52 |
+
micro avg|0.963|0.958|0.961
|
| 53 |
+
macro avg|0.910|0.831|0.867
|