moha commited on
Commit
2b0300e
·
1 Parent(s): 4cc7541

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -13
README.md CHANGED
@@ -7,22 +7,16 @@ widget:
7
  **ARABERT COVID-19** is a pretrained (fine-tuned) version of the AraBERT v2 model (https://huggingface.co/aubmindlab/bert-base-arabertv02). The pretraining was done using 1.5 million multi-dialect Arabic tweets regarding the COVID-19 pandemic from the “Large Arabic Twitter Dataset on COVID-19” (https://arxiv.org/abs/2004.04315).
8
  The model can achieve better results for the tasks that deal with multi-dialect Arabic tweets in relation to the COVID-19 pandemic.
9
 
10
- # Classification results for multiple fake-news detection tasks with and without using the arabert_c19:
11
  For more details refer to the paper (link)
12
 
13
 
14
- | | arabert | mbert | distilbert multi | arabert Covid-19 | mbert Covid-19 |
15
- |---------------------------|----------|----------|------------------|------------------|----------------|
16
- | Contains hate | 0.8346 | 0.6675 | 0.7145 | 0.8649 | 0.8492 |
17
- | Talk about a cure | 0.8193 | 0.7406 | 0.7127 | 0.9055 | 0.9176 |
18
- | Give advice | 0.8287 | 0.6865 | 0.6974 | 0.9035 | 0.8948 |
19
- | Rise moral | 0.8398 | 0.7075 | 0.7049 | 0.8903 | 0.8838 |
20
- | News or opinion | 0.8987 | 0.8332 | 0.8099 | 0.9163 | 0.9116 |
21
- | Dialect | 0.7533 | 0.558 | 0.5433 | 0.823 | 0.7682 |
22
- | Blame and negative speech | 0.7426 | 0.597 | 0.6221 | 0.7997 | 0.7794 |
23
- | Factual | 0.9217 | 0.8427 | 0.8383 | 0.9575 | 0.9608 |
24
- | Worth fact-checking | 0.7731 | 0.5298 | 0.5413 | 0.8265 | 0.8383 |
25
- | Contains fake information | 0.6415 | 0.5428 | 0.4743 | 0.7739 | 0.7228 |
26
 
27
 
28
  # Preprocessing
 
7
  **ARABERT COVID-19** is a pretrained (fine-tuned) version of the AraBERT v2 model (https://huggingface.co/aubmindlab/bert-base-arabertv02). The pretraining was done using 1.5 million multi-dialect Arabic tweets regarding the COVID-19 pandemic from the “Large Arabic Twitter Dataset on COVID-19” (https://arxiv.org/abs/2004.04315).
8
  The model can achieve better results for the tasks that deal with multi-dialect Arabic tweets in relation to the COVID-19 pandemic.
9
 
10
+ # Classification results for multiple tasks including fake-news and hate speech detection when using arabert_c19 and mbert_ar_c19:
11
  For more details refer to the paper (link)
12
 
13
 
14
+ | | arabert | mbert | distilbert multi | arabert Covid-19 | mbert Covid-19 |
15
+ |------------------------------------|----------|----------|------------------|------------------|----------------|
16
+ | Contains hate (Binary) | 0.8346 | 0.6675 | 0.7145 | 0.8649 | 0.8492 |
17
+ | Talk about a cure (Binary) | 0.8193 | 0.7406 | 0.7127 | 0.9055 | 0.9176 |
18
+ | News or opinion (Binary) | 0.8987 | 0.8332 | 0.8099 | 0.9163 | 0.9116 |
19
+ | Contains fake information (Binary) | 0.6415 | 0.5428 | 0.4743 | 0.7739 | 0.7228 |
 
 
 
 
 
 
20
 
21
 
22
  # Preprocessing