How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("token-classification", model="SinaLab/ArabicNER-Wojood")
# Load model directly
from transformers import AutoTokenizer, AutoModelForMaskedLM

tokenizer = AutoTokenizer.from_pretrained("SinaLab/ArabicNER-Wojood")
model = AutoModelForMaskedLM.from_pretrained("SinaLab/ArabicNER-Wojood")
Quick Links

Wojood - Nested/Flat Arabic NER Models

Wojood is a corpus for Arabic nested Named Entity Recognition (NER). Nested entities occur when one entity mention is embedded inside another entity mention. 550K tokens (MSA and dialect) This repo contains the source-code to train Wojood nested NER.

Online Demo You can try our model using the demo link below

https://sina.birzeit.edu/wojood/

https://arxiv.org/abs/2205.09651

https://huggingface.co/aubmindlab/bert-base-arabertv2/tree/main

Models

  • Nested NER (main branch), with micro-F1 score of 0.909551
  • Flat NER (flat branch), with micro-F1 score 0.883847

Google Colab Notebooks

You can test our model using our Google Colab notebooks

Downloads last month
43
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Collection including SinaLab/ArabicNER-Wojood

Paper for SinaLab/ArabicNER-Wojood