Instructions to use vbius01/est-roberta-ud-ner with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use vbius01/est-roberta-ud-ner with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("token-classification", model="vbius01/est-roberta-ud-ner")# Load model directly from transformers import AutoTokenizer, AutoModelForTokenClassification tokenizer = AutoTokenizer.from_pretrained("vbius01/est-roberta-ud-ner") model = AutoModelForTokenClassification.from_pretrained("vbius01/est-roberta-ud-ner") - Notebooks
- Google Colab
- Kaggle
est-roberta-ud-ner
Model Description
est-roberta-ud-ner is an Est-RoBERTa based model fine-tuned for named entity recognition in Estonian on the EDT and EWT datasets.
How to use
The model can be used with Transformers pipeline for NER. Try it in Google Colab, where the Transformers library is pre-installed or on your local machine (preferably using a virtual environment, see tutorial below) and install the Transformers library using pip install transformers.
from transformers import pipeline
ner = pipeline("ner", model="vbius01/est-roberta-ud-ner")
text = "Eesti kuulub erinevalt Lätist ja Leedust kahtlemata Põhjamaade kultuuriruumi."
results = ner(text)
print(results)
[{'entity': 'B-GEP', 'score': np.float32(0.99339926), 'index': 1, 'word': '▁Eesti', 'start': 0, 'end': 5}, {'entity': 'B-GEP', 'score': np.float32(0.9923631), 'index': 4, 'word': '▁Lätist', 'start': 22, 'end': 29}, {'entity': 'B-GEP', 'score': np.float32(0.990756), 'index': 6, 'word': '▁Leedust', 'start': 32, 'end': 40}, {'entity': 'B-LOC', 'score': np.float32(0.61792), 'index': 8, 'word': '▁Põhjamaade', 'start': 51, 'end': 62}]
- Repository: github.com/martinkivisikk/ner_thesis
- Paper: Developing a NER Model Based on Treebank Corpora
Virtual environment setup
Create and activate a virtual environment in your project directory with venv.
python -m venv .env
source .env/bin/activate
Uses
This model can be used to find named entities from Estonian texts.
- Downloads last month
- -
Model tree for vbius01/est-roberta-ud-ner
Base model
EMBEDDIA/est-roberta