metabloit
/

swahBERT

@@ -11,30 +11,29 @@ model-index:
 - name: v1
   results:
   - task:
-      type: "Offensive words classifier"
-      name: "Text Classification"
     metrics:
-      - type: f1
-        value: 0.9272349272349272
-        name: F1 Score
-        verified: false
-      - type: precision
-        value: 0.9550321199143469
-        name: Precision
-        verified: false
-      - type: recall
-        value: 0.901010101010101
-        name: Recall
-        verified: false
-      - type: accuracy
-        value: 0.9292214357937311
-        name: Accuracy
-        verified: false
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
 # swahBERT
 This model was trained from scratch on an unknown dataset.
 It achieves the following results on the evaluation set:
@@ -44,12 +43,11 @@ It achieves the following results on the evaluation set:
 - Recall: 0.9010
 - F1: 0.9272
 ## Model description
-More information needed
-## Intended uses & limitations
-More information needed
 ## Training and evaluation data
-More information needed
-## Training procedure
 ### Training hyperparameters
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
@@ -80,4 +78,17 @@ The following hyperparameters were used during training:
 - Transformers 4.33.1
 - Pytorch 2.0.1+cpu
 - Datasets 2.14.5
-- Tokenizers 0.13.3

 - name: v1
   results:
   - task:
+      type: Offensive words classifier
+      name: Text Classification
     metrics:
+    - type: f1
+      value: 0.9272349272349272
+      name: F1 Score
+      verified: false
+    - type: precision
+      value: 0.9550321199143469
+      name: Precision
+      verified: false
+    - type: recall
+      value: 0.901010101010101
+      name: Recall
+      verified: false
+    - type: accuracy
+      value: 0.9292214357937311
+      name: Accuracy
+      verified: false
+datasets:
+- metabloit/offensive-swahili-text
 ---
 # swahBERT
 This model was trained from scratch on an unknown dataset.
 It achieves the following results on the evaluation set:
 - Recall: 0.9010
 - F1: 0.9272
 ## Model description
+This is a fine tuned swahBERT model. You can get the original model from [here](https://github.com/gatimartin/SwahBERT "swahBERT Model")
 ## Training and evaluation data
+The model was fine tuned using [this dataset](https://huggingface.co/datasets/metabloit/offensive-swahili-text "Swahili offensive/non-offensive dataset")
 ### Training hyperparameters
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
 - Transformers 4.33.1
 - Pytorch 2.0.1+cpu
 - Datasets 2.14.5
+- Tokenizers 0.13.3
+## References
+@inproceedings{martin-etal-2022-swahbert,
+    title = "{S}wah{BERT}: Language Model of {S}wahili",
+    author = "Martin, Gati  and Mswahili, Medard Edmund  and Jeong, Young-Seob  and Woo, Jiyoung",
+    booktitle = "Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies",
+    month = jul,
+    year = "2022",
+    address = "Seattle, United States",
+    publisher = "Association for Computational Linguistics",
+    url = "https://aclanthology.org/2022.naacl-main.23",
+    pages = "303--313"
+    }