marefa-nlp
/

marefa-ner

@@ -1,3 +1,4 @@
 ---
 language: ar
 datasets:
@@ -9,6 +10,9 @@ widget:
 # Tebyan تبيـان
 ## Marefa Arabic Named Entity Recognition Model
 ## نموذج المعرفة لتصنيف أجزاء النص
 ---------
 **Version**: 1.3
@@ -31,7 +35,7 @@ Person, Location, Organization, Nationality, Job, Product, Event, Time, Art-Work
 *You can test the model quickly by checking this [Colab notebook](https://colab.research.google.com/drive/1OGp9Wgm-oBM5BBhTLx6Qow4dNRSJZ-F5?usp=sharing)*
------
 Install the following Python packages
@@ -43,8 +47,6 @@ Install the following Python packages
 -----------
 ```python
-# ==== Set configurations
 from transformers import AutoTokenizer, AutoModelForTokenClassification
 import torch
@@ -170,6 +172,25 @@ Output
 Check this [notebook](https://colab.research.google.com/drive/1WUYrnmDFFEItqGMvbyjqZEJJqwU7xQR-?usp=sharing) to fine-tune the NER model
 ## Acknowledgment شكر و تقدير
 قام بإعداد البيانات التي تم تدريب النموذج عليها, مجموعة من المتطوعين الذين قضوا ساعات يقومون بتنقيح البيانات و مراجعتها

 ---
 language: ar
 datasets:
 # Tebyan تبيـان
 ## Marefa Arabic Named Entity Recognition Model
 ## نموذج المعرفة لتصنيف أجزاء النص
+![Marfa Arabic NER Model](/assets/marefa-tebyan-banner.png)
 ---------
 **Version**: 1.3
 *You can test the model quickly by checking this [Colab notebook](https://colab.research.google.com/drive/1OGp9Wgm-oBM5BBhTLx6Qow4dNRSJZ-F5?usp=sharing)*
+----
 Install the following Python packages
 -----------
 ```python
 from transformers import AutoTokenizer, AutoModelForTokenClassification
 import torch
 Check this [notebook](https://colab.research.google.com/drive/1WUYrnmDFFEItqGMvbyjqZEJJqwU7xQR-?usp=sharing) to fine-tune the NER model
+## Evaluation
+We tested the model agains a test set of 1959 sentences. The results is in the follwing table
+| type         |   f1-score |   precision |   recall |   support |
+|:-------------|-----------:|------------:|---------:|----------:|
+| person       |   0.93298  |    0.931479 | 0.934487 |      4335 |
+| location     |   0.891537 |    0.896926 | 0.886212 |      4939 |
+| time         |   0.873003 |    0.876087 | 0.869941 |      1853 |
+| nationality  |   0.871246 |    0.843153 | 0.901277 |      2350 |
+| job          |   0.837656 |    0.79912  | 0.880097 |      2477 |
+| organization |   0.781317 |    0.773328 | 0.789474 |      2299 |
+| event        |   0.686695 |    0.733945 | 0.645161 |       744 |
+| artwork      |   0.653552 |    0.678005 | 0.630802 |       474 |
+| product      |   0.625483 |    0.553531 | 0.718935 |       338 |
+| **weighted avg** |   0.859008 |    0.852365 | 0.86703  |     19809 |
+| **micro avg**    |   0.858771 |    0.850669 | 0.86703  |     19809 |
+| **macro avg**   |   0.79483  |    0.787286 | 0.806265 |     19809 |
 ## Acknowledgment شكر و تقدير
 قام بإعداد البيانات التي تم تدريب النموذج عليها, مجموعة من المتطوعين الذين قضوا ساعات يقومون بتنقيح البيانات و مراجعتها

assets/marefa-tebyan.png ADDED Viewed