7beshoyarnest commited on
Commit
7a3b8c9
·
verified ·
1 Parent(s): da93c08

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +52 -3
README.md CHANGED
@@ -29,18 +29,67 @@ It achieves the following results on the evaluation set:
29
 
30
  ## Model description
31
 
32
- More information needed
 
 
 
 
 
 
 
 
33
 
34
  ## Intended uses & limitations
35
 
36
- More information needed
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
37
 
38
  ## Training and evaluation data
39
 
40
- More information needed
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
41
 
42
  ## Training procedure
43
 
 
 
 
 
 
 
 
 
44
  ### Training hyperparameters
45
 
46
  The following hyperparameters were used during training:
 
29
 
30
  ## Model description
31
 
32
+ This model is a fine-tuned version of
33
+ [aubmindlab/bert-base-arabertv02](aubmindlab/bert-base-arabertv02)
34
+ ,
35
+ adapted for Arabic Sentiment Analysis.
36
+
37
+ The model is trained to classify Arabic text into binary sentiment classes (Positive / Negative).
38
+ It is suitable for analyzing opinions expressed in Modern Standard Arabic (MSA) as well as dialectal Arabic, commonly found in social media posts, product reviews, and user feedback.
39
+
40
+ The model benefits from AraBERT’s strong contextual understanding of Arabic morphology and syntax, resulting in high classification accuracy.
41
 
42
  ## Intended uses & limitations
43
 
44
+ This model can be used for:
45
+
46
+ Arabic sentiment analysis
47
+
48
+ Social media opinion mining
49
+
50
+ Customer feedback analysis
51
+
52
+ Academic research and NLP experiments
53
+
54
+ Graduation and portfolio projects
55
+
56
+ It is designed for inference on short to medium-length Arabic texts.
57
+
58
+ Limitations
59
+
60
+ The model performs binary sentiment classification only (no neutral class).
61
+
62
+ Performance may degrade on very long documents.
63
 
64
  ## Training and evaluation data
65
 
66
+ Training and Evaluation Data
67
+
68
+ The model was trained and evaluated using the [ramybaly/arsentd_lev dataset](ramybaly/arsentd_lev) dataset, which consists of Arabic text labeled for sentiment polarity.
69
+
70
+ Dataset Characteristics
71
+
72
+ Language: Arabic
73
+
74
+ Labels: Positive, Negative
75
+
76
+ Text Type: Short Arabic opinions and statements
77
+
78
+ Domains: General opinionated text
79
+
80
+ The dataset was split into training, evaluation, and test sets following standard supervised learning practices.
81
+
82
 
83
  ## Training procedure
84
 
85
+ Preprocessing
86
+
87
+ Arabic text normalization handled by AraBERT tokenizer
88
+
89
+ Tokenization using the AraBERT v02 tokenizer
90
+
91
+ Padding and truncation applied to ensure fixed input length
92
+
93
  ### Training hyperparameters
94
 
95
  The following hyperparameters were used during training: