File size: 21,877 Bytes
2bb3165
 
 
 
 
 
49ec2d7
2bb3165
 
 
 
 
 
49ec2d7
2bb3165
 
 
49ec2d7
 
 
 
2bb3165
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
49ec2d7
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2bb3165
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
---
library_name: transformers
base_model: aubmindlab/bert-base-arabertv02
tags:
- generated_from_trainer
model-index:
- name: Arabic_CrossPrompt_FineTuningAraBERT_noAug_TestTask5_development
  results: []
---

<!-- This model card has been generated automatically according to the information the Trainer had access to. You
should probably proofread and complete it, then remove this comment. -->

# Arabic_CrossPrompt_FineTuningAraBERT_noAug_TestTask5_development

This model is a fine-tuned version of [aubmindlab/bert-base-arabertv02](https://huggingface.co/aubmindlab/bert-base-arabertv02) on the None dataset.
It achieves the following results on the evaluation set:
- Loss: 0.7245
- Qwk: 0.3105
- Mse: 0.7245
- Rmse: 0.8512

## Model description

More information needed

## Intended uses & limitations

More information needed

## Training and evaluation data

More information needed

## Training procedure

### Training hyperparameters

The following hyperparameters were used during training:
- learning_rate: 2e-05
- train_batch_size: 8
- eval_batch_size: 8
- seed: 42
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- num_epochs: 100

### Training results

| Training Loss | Epoch  | Step | Validation Loss | Qwk     | Mse    | Rmse   |
|:-------------:|:------:|:----:|:---------------:|:-------:|:------:|:------:|
| No log        | 0.0198 | 2    | 3.7605          | -0.0067 | 3.7605 | 1.9392 |
| No log        | 0.0396 | 4    | 2.8011          | 0.0565  | 2.8011 | 1.6736 |
| No log        | 0.0594 | 6    | 1.3344          | 0.0631  | 1.3344 | 1.1551 |
| No log        | 0.0792 | 8    | 0.8512          | 0.0282  | 0.8512 | 0.9226 |
| No log        | 0.0990 | 10   | 0.7192          | 0.1352  | 0.7192 | 0.8481 |
| No log        | 0.1188 | 12   | 0.8331          | 0.0430  | 0.8331 | 0.9127 |
| No log        | 0.1386 | 14   | 0.8840          | 0.0271  | 0.8840 | 0.9402 |
| No log        | 0.1584 | 16   | 0.8207          | 0.0977  | 0.8207 | 0.9059 |
| No log        | 0.1782 | 18   | 0.6957          | 0.1803  | 0.6957 | 0.8341 |
| No log        | 0.1980 | 20   | 0.6125          | 0.2963  | 0.6125 | 0.7826 |
| No log        | 0.2178 | 22   | 0.5972          | 0.3619  | 0.5972 | 0.7728 |
| No log        | 0.2376 | 24   | 0.5931          | 0.3692  | 0.5931 | 0.7701 |
| No log        | 0.2574 | 26   | 0.5902          | 0.3582  | 0.5902 | 0.7683 |
| No log        | 0.2772 | 28   | 0.6011          | 0.3003  | 0.6011 | 0.7753 |
| No log        | 0.2970 | 30   | 0.6363          | 0.2651  | 0.6363 | 0.7977 |
| No log        | 0.3168 | 32   | 0.6449          | 0.2444  | 0.6449 | 0.8031 |
| No log        | 0.3366 | 34   | 0.5643          | 0.3763  | 0.5643 | 0.7512 |
| No log        | 0.3564 | 36   | 0.5844          | 0.4751  | 0.5844 | 0.7645 |
| No log        | 0.3762 | 38   | 0.5590          | 0.4862  | 0.5590 | 0.7477 |
| No log        | 0.3960 | 40   | 0.5536          | 0.5061  | 0.5536 | 0.7440 |
| No log        | 0.4158 | 42   | 0.5922          | 0.4717  | 0.5922 | 0.7695 |
| No log        | 0.4356 | 44   | 0.6937          | 0.4583  | 0.6937 | 0.8329 |
| No log        | 0.4554 | 46   | 0.7036          | 0.4668  | 0.7036 | 0.8388 |
| No log        | 0.4752 | 48   | 0.7354          | 0.3480  | 0.7354 | 0.8576 |
| No log        | 0.4950 | 50   | 0.6405          | 0.3608  | 0.6405 | 0.8003 |
| No log        | 0.5149 | 52   | 0.5102          | 0.4292  | 0.5102 | 0.7143 |
| No log        | 0.5347 | 54   | 0.4952          | 0.4374  | 0.4952 | 0.7037 |
| No log        | 0.5545 | 56   | 0.5736          | 0.3493  | 0.5736 | 0.7574 |
| No log        | 0.5743 | 58   | 0.6521          | 0.2356  | 0.6521 | 0.8075 |
| No log        | 0.5941 | 60   | 0.6310          | 0.2754  | 0.6310 | 0.7944 |
| No log        | 0.6139 | 62   | 0.5271          | 0.3577  | 0.5271 | 0.7260 |
| No log        | 0.6337 | 64   | 0.4680          | 0.4847  | 0.4680 | 0.6841 |
| No log        | 0.6535 | 66   | 0.4714          | 0.4828  | 0.4714 | 0.6866 |
| No log        | 0.6733 | 68   | 0.5387          | 0.4875  | 0.5387 | 0.7340 |
| No log        | 0.6931 | 70   | 0.5428          | 0.4918  | 0.5428 | 0.7368 |
| No log        | 0.7129 | 72   | 0.4751          | 0.5349  | 0.4751 | 0.6892 |
| No log        | 0.7327 | 74   | 0.4681          | 0.5247  | 0.4681 | 0.6842 |
| No log        | 0.7525 | 76   | 0.4498          | 0.5342  | 0.4498 | 0.6706 |
| No log        | 0.7723 | 78   | 0.4825          | 0.4332  | 0.4825 | 0.6946 |
| No log        | 0.7921 | 80   | 0.4921          | 0.4432  | 0.4921 | 0.7015 |
| No log        | 0.8119 | 82   | 0.5618          | 0.4054  | 0.5618 | 0.7495 |
| No log        | 0.8317 | 84   | 0.6938          | 0.2423  | 0.6938 | 0.8329 |
| No log        | 0.8515 | 86   | 0.6828          | 0.2279  | 0.6828 | 0.8263 |
| No log        | 0.8713 | 88   | 0.6146          | 0.2400  | 0.6146 | 0.7840 |
| No log        | 0.8911 | 90   | 0.5385          | 0.3728  | 0.5385 | 0.7338 |
| No log        | 0.9109 | 92   | 0.5151          | 0.4848  | 0.5151 | 0.7177 |
| No log        | 0.9307 | 94   | 0.4932          | 0.5136  | 0.4932 | 0.7023 |
| No log        | 0.9505 | 96   | 0.4805          | 0.4512  | 0.4805 | 0.6932 |
| No log        | 0.9703 | 98   | 0.5936          | 0.2649  | 0.5936 | 0.7704 |
| No log        | 0.9901 | 100  | 0.6885          | 0.3259  | 0.6885 | 0.8298 |
| No log        | 1.0099 | 102  | 0.7979          | 0.2878  | 0.7979 | 0.8932 |
| No log        | 1.0297 | 104  | 0.7656          | 0.2974  | 0.7656 | 0.8750 |
| No log        | 1.0495 | 106  | 0.6099          | 0.3885  | 0.6099 | 0.7809 |
| No log        | 1.0693 | 108  | 0.4973          | 0.4407  | 0.4973 | 0.7052 |
| No log        | 1.0891 | 110  | 0.4892          | 0.4195  | 0.4892 | 0.6994 |
| No log        | 1.1089 | 112  | 0.5174          | 0.4398  | 0.5174 | 0.7193 |
| No log        | 1.1287 | 114  | 0.6280          | 0.3519  | 0.6280 | 0.7925 |
| No log        | 1.1485 | 116  | 0.7064          | 0.3029  | 0.7064 | 0.8405 |
| No log        | 1.1683 | 118  | 0.7294          | 0.2953  | 0.7294 | 0.8541 |
| No log        | 1.1881 | 120  | 0.6667          | 0.3768  | 0.6667 | 0.8165 |
| No log        | 1.2079 | 122  | 0.5777          | 0.4478  | 0.5777 | 0.7601 |
| No log        | 1.2277 | 124  | 0.4983          | 0.5306  | 0.4983 | 0.7059 |
| No log        | 1.2475 | 126  | 0.4913          | 0.5433  | 0.4913 | 0.7009 |
| No log        | 1.2673 | 128  | 0.5557          | 0.5109  | 0.5557 | 0.7454 |
| No log        | 1.2871 | 130  | 0.6744          | 0.4407  | 0.6744 | 0.8212 |
| No log        | 1.3069 | 132  | 0.6969          | 0.4177  | 0.6969 | 0.8348 |
| No log        | 1.3267 | 134  | 0.6493          | 0.4307  | 0.6493 | 0.8058 |
| No log        | 1.3465 | 136  | 0.6172          | 0.4142  | 0.6172 | 0.7856 |
| No log        | 1.3663 | 138  | 0.5356          | 0.5170  | 0.5356 | 0.7318 |
| No log        | 1.3861 | 140  | 0.5252          | 0.5816  | 0.5252 | 0.7247 |
| No log        | 1.4059 | 142  | 0.5504          | 0.5412  | 0.5504 | 0.7419 |
| No log        | 1.4257 | 144  | 0.5542          | 0.5179  | 0.5542 | 0.7444 |
| No log        | 1.4455 | 146  | 0.5213          | 0.5134  | 0.5213 | 0.7220 |
| No log        | 1.4653 | 148  | 0.5340          | 0.4352  | 0.5340 | 0.7307 |
| No log        | 1.4851 | 150  | 0.5923          | 0.4012  | 0.5923 | 0.7696 |
| No log        | 1.5050 | 152  | 0.5732          | 0.4007  | 0.5732 | 0.7571 |
| No log        | 1.5248 | 154  | 0.5631          | 0.3466  | 0.5631 | 0.7504 |
| No log        | 1.5446 | 156  | 0.5107          | 0.3964  | 0.5107 | 0.7146 |
| No log        | 1.5644 | 158  | 0.4817          | 0.4400  | 0.4817 | 0.6940 |
| No log        | 1.5842 | 160  | 0.4540          | 0.4816  | 0.4540 | 0.6738 |
| No log        | 1.6040 | 162  | 0.4530          | 0.4812  | 0.4530 | 0.6730 |
| No log        | 1.6238 | 164  | 0.4567          | 0.4801  | 0.4567 | 0.6758 |
| No log        | 1.6436 | 166  | 0.4917          | 0.4556  | 0.4917 | 0.7012 |
| No log        | 1.6634 | 168  | 0.5512          | 0.3964  | 0.5512 | 0.7424 |
| No log        | 1.6832 | 170  | 0.5656          | 0.3859  | 0.5656 | 0.7521 |
| No log        | 1.7030 | 172  | 0.5109          | 0.4484  | 0.5109 | 0.7148 |
| No log        | 1.7228 | 174  | 0.4603          | 0.4713  | 0.4603 | 0.6784 |
| No log        | 1.7426 | 176  | 0.4735          | 0.5088  | 0.4735 | 0.6881 |
| No log        | 1.7624 | 178  | 0.4627          | 0.4798  | 0.4627 | 0.6802 |
| No log        | 1.7822 | 180  | 0.5659          | 0.3922  | 0.5659 | 0.7523 |
| No log        | 1.8020 | 182  | 0.6706          | 0.3399  | 0.6706 | 0.8189 |
| No log        | 1.8218 | 184  | 0.6763          | 0.3237  | 0.6763 | 0.8224 |
| No log        | 1.8416 | 186  | 0.6173          | 0.3578  | 0.6173 | 0.7857 |
| No log        | 1.8614 | 188  | 0.5182          | 0.4200  | 0.5182 | 0.7199 |
| No log        | 1.8812 | 190  | 0.4758          | 0.4675  | 0.4758 | 0.6898 |
| No log        | 1.9010 | 192  | 0.4792          | 0.4841  | 0.4792 | 0.6923 |
| No log        | 1.9208 | 194  | 0.5308          | 0.4743  | 0.5308 | 0.7285 |
| No log        | 1.9406 | 196  | 0.5322          | 0.4548  | 0.5322 | 0.7295 |
| No log        | 1.9604 | 198  | 0.5646          | 0.4816  | 0.5646 | 0.7514 |
| No log        | 1.9802 | 200  | 0.6169          | 0.5112  | 0.6169 | 0.7854 |
| No log        | 2.0    | 202  | 0.7013          | 0.4769  | 0.7013 | 0.8375 |
| No log        | 2.0198 | 204  | 0.6028          | 0.4985  | 0.6028 | 0.7764 |
| No log        | 2.0396 | 206  | 0.5873          | 0.5141  | 0.5873 | 0.7663 |
| No log        | 2.0594 | 208  | 0.6611          | 0.4767  | 0.6611 | 0.8131 |
| No log        | 2.0792 | 210  | 0.6622          | 0.4488  | 0.6622 | 0.8138 |
| No log        | 2.0990 | 212  | 0.7040          | 0.3471  | 0.7040 | 0.8391 |
| No log        | 2.1188 | 214  | 0.5945          | 0.3831  | 0.5945 | 0.7710 |
| No log        | 2.1386 | 216  | 0.4957          | 0.4770  | 0.4957 | 0.7040 |
| No log        | 2.1584 | 218  | 0.4748          | 0.5418  | 0.4748 | 0.6891 |
| No log        | 2.1782 | 220  | 0.5029          | 0.4216  | 0.5029 | 0.7091 |
| No log        | 2.1980 | 222  | 0.6301          | 0.3157  | 0.6301 | 0.7938 |
| No log        | 2.2178 | 224  | 0.7521          | 0.2272  | 0.7521 | 0.8672 |
| No log        | 2.2376 | 226  | 0.8176          | 0.3472  | 0.8176 | 0.9042 |
| No log        | 2.2574 | 228  | 0.8492          | 0.3592  | 0.8492 | 0.9215 |
| No log        | 2.2772 | 230  | 0.6807          | 0.3907  | 0.6807 | 0.8250 |
| No log        | 2.2970 | 232  | 0.5219          | 0.4600  | 0.5219 | 0.7224 |
| No log        | 2.3168 | 234  | 0.4768          | 0.4322  | 0.4768 | 0.6905 |
| No log        | 2.3366 | 236  | 0.4723          | 0.4507  | 0.4723 | 0.6872 |
| No log        | 2.3564 | 238  | 0.4830          | 0.4339  | 0.4830 | 0.6949 |
| No log        | 2.3762 | 240  | 0.5101          | 0.3621  | 0.5101 | 0.7142 |
| No log        | 2.3960 | 242  | 0.5289          | 0.3745  | 0.5289 | 0.7272 |
| No log        | 2.4158 | 244  | 0.5172          | 0.3961  | 0.5172 | 0.7192 |
| No log        | 2.4356 | 246  | 0.4877          | 0.4104  | 0.4877 | 0.6983 |
| No log        | 2.4554 | 248  | 0.4449          | 0.4816  | 0.4449 | 0.6670 |
| No log        | 2.4752 | 250  | 0.4183          | 0.5061  | 0.4183 | 0.6468 |
| No log        | 2.4950 | 252  | 0.4268          | 0.5298  | 0.4268 | 0.6533 |
| No log        | 2.5149 | 254  | 0.4426          | 0.5212  | 0.4426 | 0.6653 |
| No log        | 2.5347 | 256  | 0.4536          | 0.5449  | 0.4536 | 0.6735 |
| No log        | 2.5545 | 258  | 0.4669          | 0.5574  | 0.4669 | 0.6833 |
| No log        | 2.5743 | 260  | 0.4610          | 0.5175  | 0.4610 | 0.6790 |
| No log        | 2.5941 | 262  | 0.4562          | 0.5003  | 0.4562 | 0.6754 |
| No log        | 2.6139 | 264  | 0.4061          | 0.5484  | 0.4061 | 0.6373 |
| No log        | 2.6337 | 266  | 0.4019          | 0.6164  | 0.4019 | 0.6339 |
| No log        | 2.6535 | 268  | 0.4296          | 0.6228  | 0.4296 | 0.6554 |
| No log        | 2.6733 | 270  | 0.5077          | 0.5435  | 0.5077 | 0.7125 |
| No log        | 2.6931 | 272  | 0.5201          | 0.4996  | 0.5201 | 0.7212 |
| No log        | 2.7129 | 274  | 0.5027          | 0.5008  | 0.5027 | 0.7090 |
| No log        | 2.7327 | 276  | 0.4196          | 0.5960  | 0.4196 | 0.6478 |
| No log        | 2.7525 | 278  | 0.4501          | 0.6120  | 0.4501 | 0.6709 |
| No log        | 2.7723 | 280  | 0.5441          | 0.5349  | 0.5441 | 0.7376 |
| No log        | 2.7921 | 282  | 0.6648          | 0.4825  | 0.6648 | 0.8153 |
| No log        | 2.8119 | 284  | 0.8039          | 0.4531  | 0.8039 | 0.8966 |
| No log        | 2.8317 | 286  | 0.7183          | 0.4687  | 0.7183 | 0.8475 |
| No log        | 2.8515 | 288  | 0.5073          | 0.5572  | 0.5073 | 0.7123 |
| No log        | 2.8713 | 290  | 0.4173          | 0.5626  | 0.4173 | 0.6460 |
| No log        | 2.8911 | 292  | 0.4065          | 0.5732  | 0.4065 | 0.6375 |
| No log        | 2.9109 | 294  | 0.4110          | 0.5638  | 0.4110 | 0.6411 |
| No log        | 2.9307 | 296  | 0.4248          | 0.5598  | 0.4248 | 0.6517 |
| No log        | 2.9505 | 298  | 0.4838          | 0.5252  | 0.4838 | 0.6955 |
| No log        | 2.9703 | 300  | 0.5482          | 0.4933  | 0.5482 | 0.7404 |
| No log        | 2.9901 | 302  | 0.6601          | 0.4403  | 0.6601 | 0.8125 |
| No log        | 3.0099 | 304  | 0.6911          | 0.3841  | 0.6911 | 0.8313 |
| No log        | 3.0297 | 306  | 0.6035          | 0.4307  | 0.6035 | 0.7769 |
| No log        | 3.0495 | 308  | 0.4462          | 0.5186  | 0.4462 | 0.6680 |
| No log        | 3.0693 | 310  | 0.4178          | 0.5510  | 0.4178 | 0.6464 |
| No log        | 3.0891 | 312  | 0.4303          | 0.5432  | 0.4303 | 0.6559 |
| No log        | 3.1089 | 314  | 0.4668          | 0.4756  | 0.4668 | 0.6832 |
| No log        | 3.1287 | 316  | 0.5005          | 0.4416  | 0.5005 | 0.7075 |
| No log        | 3.1485 | 318  | 0.5076          | 0.4534  | 0.5076 | 0.7125 |
| No log        | 3.1683 | 320  | 0.4601          | 0.4944  | 0.4601 | 0.6783 |
| No log        | 3.1881 | 322  | 0.4482          | 0.5682  | 0.4482 | 0.6695 |
| No log        | 3.2079 | 324  | 0.4573          | 0.5804  | 0.4573 | 0.6762 |
| No log        | 3.2277 | 326  | 0.4351          | 0.6070  | 0.4351 | 0.6596 |
| No log        | 3.2475 | 328  | 0.4466          | 0.5865  | 0.4466 | 0.6683 |
| No log        | 3.2673 | 330  | 0.5302          | 0.5107  | 0.5302 | 0.7281 |
| No log        | 3.2871 | 332  | 0.5540          | 0.4645  | 0.5540 | 0.7443 |
| No log        | 3.3069 | 334  | 0.4564          | 0.5671  | 0.4564 | 0.6755 |
| No log        | 3.3267 | 336  | 0.4192          | 0.6667  | 0.4192 | 0.6474 |
| No log        | 3.3465 | 338  | 0.4269          | 0.6371  | 0.4269 | 0.6534 |
| No log        | 3.3663 | 340  | 0.4199          | 0.6042  | 0.4199 | 0.6480 |
| No log        | 3.3861 | 342  | 0.4786          | 0.5487  | 0.4786 | 0.6918 |
| No log        | 3.4059 | 344  | 0.5462          | 0.4735  | 0.5462 | 0.7390 |
| No log        | 3.4257 | 346  | 0.5562          | 0.4735  | 0.5562 | 0.7458 |
| No log        | 3.4455 | 348  | 0.5499          | 0.5234  | 0.5499 | 0.7415 |
| No log        | 3.4653 | 350  | 0.5564          | 0.5799  | 0.5564 | 0.7459 |
| No log        | 3.4851 | 352  | 0.5679          | 0.5590  | 0.5679 | 0.7536 |
| No log        | 3.5050 | 354  | 0.5635          | 0.5784  | 0.5635 | 0.7506 |
| No log        | 3.5248 | 356  | 0.5990          | 0.5481  | 0.5990 | 0.7740 |
| No log        | 3.5446 | 358  | 0.5921          | 0.4980  | 0.5921 | 0.7695 |
| No log        | 3.5644 | 360  | 0.5060          | 0.5269  | 0.5060 | 0.7113 |
| No log        | 3.5842 | 362  | 0.3829          | 0.6343  | 0.3829 | 0.6188 |
| No log        | 3.6040 | 364  | 0.3898          | 0.6382  | 0.3898 | 0.6244 |
| No log        | 3.6238 | 366  | 0.3971          | 0.6427  | 0.3971 | 0.6302 |
| No log        | 3.6436 | 368  | 0.4461          | 0.5986  | 0.4461 | 0.6679 |
| No log        | 3.6634 | 370  | 0.5536          | 0.5424  | 0.5536 | 0.7440 |
| No log        | 3.6832 | 372  | 0.6867          | 0.4583  | 0.6867 | 0.8287 |
| No log        | 3.7030 | 374  | 0.6940          | 0.4704  | 0.6940 | 0.8330 |
| No log        | 3.7228 | 376  | 0.5833          | 0.5085  | 0.5833 | 0.7638 |
| No log        | 3.7426 | 378  | 0.4504          | 0.5599  | 0.4504 | 0.6711 |
| No log        | 3.7624 | 380  | 0.4758          | 0.5630  | 0.4758 | 0.6898 |
| No log        | 3.7822 | 382  | 0.5093          | 0.5044  | 0.5093 | 0.7137 |
| No log        | 3.8020 | 384  | 0.4695          | 0.5460  | 0.4695 | 0.6852 |
| No log        | 3.8218 | 386  | 0.4592          | 0.5394  | 0.4592 | 0.6776 |
| No log        | 3.8416 | 388  | 0.5287          | 0.4906  | 0.5287 | 0.7271 |
| No log        | 3.8614 | 390  | 0.6134          | 0.4747  | 0.6134 | 0.7832 |
| No log        | 3.8812 | 392  | 0.6210          | 0.4740  | 0.6210 | 0.7880 |
| No log        | 3.9010 | 394  | 0.5508          | 0.5268  | 0.5508 | 0.7422 |
| No log        | 3.9208 | 396  | 0.5396          | 0.5699  | 0.5396 | 0.7346 |
| No log        | 3.9406 | 398  | 0.5154          | 0.5873  | 0.5154 | 0.7179 |
| No log        | 3.9604 | 400  | 0.5190          | 0.4956  | 0.5190 | 0.7204 |
| No log        | 3.9802 | 402  | 0.5421          | 0.4662  | 0.5421 | 0.7363 |
| No log        | 4.0    | 404  | 0.6192          | 0.4446  | 0.6192 | 0.7869 |
| No log        | 4.0198 | 406  | 0.7074          | 0.4461  | 0.7074 | 0.8411 |
| No log        | 4.0396 | 408  | 0.6836          | 0.4685  | 0.6836 | 0.8268 |
| No log        | 4.0594 | 410  | 0.5723          | 0.5120  | 0.5723 | 0.7565 |
| No log        | 4.0792 | 412  | 0.5211          | 0.5631  | 0.5211 | 0.7218 |
| No log        | 4.0990 | 414  | 0.5211          | 0.5687  | 0.5211 | 0.7219 |
| No log        | 4.1188 | 416  | 0.5070          | 0.5896  | 0.5070 | 0.7120 |
| No log        | 4.1386 | 418  | 0.5620          | 0.5387  | 0.5620 | 0.7497 |
| No log        | 4.1584 | 420  | 0.6160          | 0.5055  | 0.6160 | 0.7849 |
| No log        | 4.1782 | 422  | 0.6753          | 0.4457  | 0.6753 | 0.8218 |
| No log        | 4.1980 | 424  | 0.6623          | 0.4135  | 0.6623 | 0.8138 |
| No log        | 4.2178 | 426  | 0.5551          | 0.4282  | 0.5551 | 0.7451 |
| No log        | 4.2376 | 428  | 0.4574          | 0.5010  | 0.4574 | 0.6763 |
| No log        | 4.2574 | 430  | 0.4214          | 0.5551  | 0.4214 | 0.6492 |
| No log        | 4.2772 | 432  | 0.4166          | 0.5586  | 0.4166 | 0.6455 |
| No log        | 4.2970 | 434  | 0.4196          | 0.5693  | 0.4196 | 0.6478 |
| No log        | 4.3168 | 436  | 0.4176          | 0.5878  | 0.4176 | 0.6462 |
| No log        | 4.3366 | 438  | 0.4485          | 0.5545  | 0.4485 | 0.6697 |
| No log        | 4.3564 | 440  | 0.4782          | 0.5412  | 0.4782 | 0.6915 |
| No log        | 4.3762 | 442  | 0.4859          | 0.5918  | 0.4859 | 0.6971 |
| No log        | 4.3960 | 444  | 0.5019          | 0.5844  | 0.5019 | 0.7085 |
| No log        | 4.4158 | 446  | 0.4722          | 0.5704  | 0.4722 | 0.6872 |
| No log        | 4.4356 | 448  | 0.4662          | 0.6040  | 0.4662 | 0.6828 |
| No log        | 4.4554 | 450  | 0.4943          | 0.6096  | 0.4943 | 0.7031 |
| No log        | 4.4752 | 452  | 0.5413          | 0.5920  | 0.5413 | 0.7358 |
| No log        | 4.4950 | 454  | 0.5918          | 0.5405  | 0.5918 | 0.7693 |
| No log        | 4.5149 | 456  | 0.7302          | 0.4517  | 0.7302 | 0.8545 |
| No log        | 4.5347 | 458  | 0.7065          | 0.4641  | 0.7065 | 0.8405 |
| No log        | 4.5545 | 460  | 0.5756          | 0.5685  | 0.5756 | 0.7587 |
| No log        | 4.5743 | 462  | 0.5117          | 0.6030  | 0.5117 | 0.7153 |
| No log        | 4.5941 | 464  | 0.4985          | 0.5957  | 0.4985 | 0.7061 |
| No log        | 4.6139 | 466  | 0.4794          | 0.5670  | 0.4794 | 0.6924 |
| No log        | 4.6337 | 468  | 0.4801          | 0.5520  | 0.4801 | 0.6929 |
| No log        | 4.6535 | 470  | 0.5271          | 0.5250  | 0.5271 | 0.7260 |
| No log        | 4.6733 | 472  | 0.6567          | 0.4768  | 0.6567 | 0.8104 |
| No log        | 4.6931 | 474  | 0.7279          | 0.4515  | 0.7279 | 0.8531 |
| No log        | 4.7129 | 476  | 0.6586          | 0.4437  | 0.6586 | 0.8115 |
| No log        | 4.7327 | 478  | 0.5305          | 0.4238  | 0.5305 | 0.7284 |
| No log        | 4.7525 | 480  | 0.5091          | 0.4758  | 0.5091 | 0.7135 |
| No log        | 4.7723 | 482  | 0.4930          | 0.4796  | 0.4930 | 0.7021 |
| No log        | 4.7921 | 484  | 0.4780          | 0.5139  | 0.4780 | 0.6914 |
| No log        | 4.8119 | 486  | 0.4989          | 0.5349  | 0.4989 | 0.7063 |
| No log        | 4.8317 | 488  | 0.5307          | 0.5183  | 0.5307 | 0.7285 |
| No log        | 4.8515 | 490  | 0.6843          | 0.4820  | 0.6843 | 0.8272 |
| No log        | 4.8713 | 492  | 0.7667          | 0.4833  | 0.7667 | 0.8756 |
| No log        | 4.8911 | 494  | 0.6973          | 0.5032  | 0.6973 | 0.8350 |
| No log        | 4.9109 | 496  | 0.5441          | 0.5859  | 0.5441 | 0.7376 |
| No log        | 4.9307 | 498  | 0.4172          | 0.6639  | 0.4172 | 0.6459 |
| 0.4651        | 4.9505 | 500  | 0.4043          | 0.6353  | 0.4043 | 0.6358 |
| 0.4651        | 4.9703 | 502  | 0.4233          | 0.5856  | 0.4233 | 0.6506 |
| 0.4651        | 4.9901 | 504  | 0.4907          | 0.5683  | 0.4907 | 0.7005 |
| 0.4651        | 5.0099 | 506  | 0.5767          | 0.4570  | 0.5767 | 0.7594 |
| 0.4651        | 5.0297 | 508  | 0.7091          | 0.3346  | 0.7091 | 0.8421 |
| 0.4651        | 5.0495 | 510  | 0.7245          | 0.3105  | 0.7245 | 0.8512 |


### Framework versions

- Transformers 4.44.2
- Pytorch 2.4.0+cu118
- Datasets 2.21.0
- Tokenizers 0.19.1