File size: 21,877 Bytes
caeee92
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
---
library_name: transformers
base_model: aubmindlab/bert-base-arabertv02
tags:
- generated_from_trainer
model-index:
- name: Arabic_CrossPrompt_FineTuningAraBERT_noAug_TestTask7_development
  results: []
---

<!-- This model card has been generated automatically according to the information the Trainer had access to. You
should probably proofread and complete it, then remove this comment. -->

# Arabic_CrossPrompt_FineTuningAraBERT_noAug_TestTask7_development

This model is a fine-tuned version of [aubmindlab/bert-base-arabertv02](https://huggingface.co/aubmindlab/bert-base-arabertv02) on the None dataset.
It achieves the following results on the evaluation set:
- Loss: 0.5806
- Qwk: 0.5631
- Mse: 0.5806
- Rmse: 0.7620

## Model description

More information needed

## Intended uses & limitations

More information needed

## Training and evaluation data

More information needed

## Training procedure

### Training hyperparameters

The following hyperparameters were used during training:
- learning_rate: 2e-05
- train_batch_size: 8
- eval_batch_size: 8
- seed: 42
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- num_epochs: 100

### Training results

| Training Loss | Epoch  | Step | Validation Loss | Qwk     | Mse    | Rmse   |
|:-------------:|:------:|:----:|:---------------:|:-------:|:------:|:------:|
| No log        | 0.0202 | 2    | 3.8201          | -0.0048 | 3.8201 | 1.9545 |
| No log        | 0.0404 | 4    | 2.8516          | 0.0557  | 2.8516 | 1.6887 |
| No log        | 0.0606 | 6    | 1.6463          | 0.0997  | 1.6463 | 1.2831 |
| No log        | 0.0808 | 8    | 0.7609          | 0.1997  | 0.7609 | 0.8723 |
| No log        | 0.1010 | 10   | 0.7725          | 0.0256  | 0.7725 | 0.8789 |
| No log        | 0.1212 | 12   | 0.9123          | -0.0172 | 0.9123 | 0.9551 |
| No log        | 0.1414 | 14   | 0.8664          | 0.0026  | 0.8664 | 0.9308 |
| No log        | 0.1616 | 16   | 0.7816          | 0.0328  | 0.7816 | 0.8841 |
| No log        | 0.1818 | 18   | 0.6613          | 0.0892  | 0.6613 | 0.8132 |
| No log        | 0.2020 | 20   | 0.6727          | 0.3461  | 0.6727 | 0.8202 |
| No log        | 0.2222 | 22   | 0.7231          | 0.2589  | 0.7231 | 0.8503 |
| No log        | 0.2424 | 24   | 0.6292          | 0.3518  | 0.6292 | 0.7932 |
| No log        | 0.2626 | 26   | 0.5721          | 0.3695  | 0.5721 | 0.7564 |
| No log        | 0.2828 | 28   | 0.6909          | 0.1568  | 0.6909 | 0.8312 |
| No log        | 0.3030 | 30   | 0.6832          | 0.2332  | 0.6832 | 0.8265 |
| No log        | 0.3232 | 32   | 0.6265          | 0.3172  | 0.6265 | 0.7915 |
| No log        | 0.3434 | 34   | 0.5742          | 0.4019  | 0.5742 | 0.7577 |
| No log        | 0.3636 | 36   | 0.5533          | 0.4235  | 0.5533 | 0.7438 |
| No log        | 0.3838 | 38   | 0.5750          | 0.3371  | 0.5750 | 0.7583 |
| No log        | 0.4040 | 40   | 0.5739          | 0.3326  | 0.5739 | 0.7576 |
| No log        | 0.4242 | 42   | 0.5582          | 0.3674  | 0.5582 | 0.7471 |
| No log        | 0.4444 | 44   | 0.5163          | 0.4235  | 0.5163 | 0.7186 |
| No log        | 0.4646 | 46   | 0.5340          | 0.4104  | 0.5340 | 0.7308 |
| No log        | 0.4848 | 48   | 0.6408          | 0.2833  | 0.6408 | 0.8005 |
| No log        | 0.5051 | 50   | 0.6745          | 0.2680  | 0.6745 | 0.8212 |
| No log        | 0.5253 | 52   | 0.6289          | 0.3398  | 0.6289 | 0.7930 |
| No log        | 0.5455 | 54   | 0.5498          | 0.4331  | 0.5498 | 0.7415 |
| No log        | 0.5657 | 56   | 0.5390          | 0.5119  | 0.5390 | 0.7342 |
| No log        | 0.5859 | 58   | 0.5254          | 0.5022  | 0.5254 | 0.7248 |
| No log        | 0.6061 | 60   | 0.5307          | 0.4131  | 0.5307 | 0.7285 |
| No log        | 0.6263 | 62   | 0.5910          | 0.3423  | 0.5910 | 0.7688 |
| No log        | 0.6465 | 64   | 0.7147          | 0.2449  | 0.7147 | 0.8454 |
| No log        | 0.6667 | 66   | 0.7491          | 0.2421  | 0.7491 | 0.8655 |
| No log        | 0.6869 | 68   | 0.6881          | 0.3737  | 0.6881 | 0.8295 |
| No log        | 0.7071 | 70   | 0.6412          | 0.5019  | 0.6412 | 0.8007 |
| No log        | 0.7273 | 72   | 0.5705          | 0.5596  | 0.5705 | 0.7553 |
| No log        | 0.7475 | 74   | 0.4957          | 0.5507  | 0.4957 | 0.7040 |
| No log        | 0.7677 | 76   | 0.4640          | 0.5208  | 0.4640 | 0.6812 |
| No log        | 0.7879 | 78   | 0.4573          | 0.5272  | 0.4573 | 0.6762 |
| No log        | 0.8081 | 80   | 0.4450          | 0.5088  | 0.4450 | 0.6671 |
| No log        | 0.8283 | 82   | 0.4706          | 0.4794  | 0.4706 | 0.6860 |
| No log        | 0.8485 | 84   | 0.5419          | 0.3899  | 0.5419 | 0.7361 |
| No log        | 0.8687 | 86   | 0.6341          | 0.3609  | 0.6341 | 0.7963 |
| No log        | 0.8889 | 88   | 0.5732          | 0.4481  | 0.5732 | 0.7571 |
| No log        | 0.9091 | 90   | 0.5485          | 0.5585  | 0.5485 | 0.7406 |
| No log        | 0.9293 | 92   | 0.4730          | 0.5883  | 0.4730 | 0.6877 |
| No log        | 0.9495 | 94   | 0.4443          | 0.6155  | 0.4443 | 0.6666 |
| No log        | 0.9697 | 96   | 0.4999          | 0.6115  | 0.4999 | 0.7071 |
| No log        | 0.9899 | 98   | 0.4884          | 0.6075  | 0.4884 | 0.6988 |
| No log        | 1.0101 | 100  | 0.5933          | 0.5814  | 0.5933 | 0.7703 |
| No log        | 1.0303 | 102  | 0.6671          | 0.5429  | 0.6671 | 0.8167 |
| No log        | 1.0505 | 104  | 0.6513          | 0.5636  | 0.6513 | 0.8070 |
| No log        | 1.0707 | 106  | 0.5374          | 0.5934  | 0.5374 | 0.7331 |
| No log        | 1.0909 | 108  | 0.5365          | 0.5972  | 0.5365 | 0.7325 |
| No log        | 1.1111 | 110  | 0.5230          | 0.5371  | 0.5230 | 0.7232 |
| No log        | 1.1313 | 112  | 0.5504          | 0.4835  | 0.5504 | 0.7419 |
| No log        | 1.1515 | 114  | 0.5192          | 0.4909  | 0.5192 | 0.7205 |
| No log        | 1.1717 | 116  | 0.5146          | 0.5294  | 0.5146 | 0.7173 |
| No log        | 1.1919 | 118  | 0.4893          | 0.5260  | 0.4893 | 0.6995 |
| No log        | 1.2121 | 120  | 0.4794          | 0.5402  | 0.4794 | 0.6924 |
| No log        | 1.2323 | 122  | 0.4597          | 0.5205  | 0.4597 | 0.6780 |
| No log        | 1.2525 | 124  | 0.4893          | 0.5129  | 0.4893 | 0.6995 |
| No log        | 1.2727 | 126  | 0.5888          | 0.4181  | 0.5888 | 0.7673 |
| No log        | 1.2929 | 128  | 0.5927          | 0.3581  | 0.5927 | 0.7698 |
| No log        | 1.3131 | 130  | 0.5709          | 0.3834  | 0.5709 | 0.7556 |
| No log        | 1.3333 | 132  | 0.5217          | 0.4659  | 0.5217 | 0.7223 |
| No log        | 1.3535 | 134  | 0.4838          | 0.5322  | 0.4838 | 0.6956 |
| No log        | 1.3737 | 136  | 0.5128          | 0.5551  | 0.5128 | 0.7161 |
| No log        | 1.3939 | 138  | 0.6273          | 0.5480  | 0.6273 | 0.7920 |
| No log        | 1.4141 | 140  | 0.6923          | 0.5571  | 0.6923 | 0.8321 |
| No log        | 1.4343 | 142  | 0.6351          | 0.5576  | 0.6351 | 0.7969 |
| No log        | 1.4545 | 144  | 0.6667          | 0.5239  | 0.6667 | 0.8165 |
| No log        | 1.4747 | 146  | 0.7168          | 0.4242  | 0.7168 | 0.8466 |
| No log        | 1.4949 | 148  | 0.6381          | 0.4236  | 0.6381 | 0.7988 |
| No log        | 1.5152 | 150  | 0.6385          | 0.3783  | 0.6385 | 0.7991 |
| No log        | 1.5354 | 152  | 0.6551          | 0.3322  | 0.6551 | 0.8094 |
| No log        | 1.5556 | 154  | 0.6203          | 0.3688  | 0.6203 | 0.7876 |
| No log        | 1.5758 | 156  | 0.5377          | 0.4622  | 0.5377 | 0.7333 |
| No log        | 1.5960 | 158  | 0.6562          | 0.5153  | 0.6562 | 0.8101 |
| No log        | 1.6162 | 160  | 0.9157          | 0.4221  | 0.9157 | 0.9569 |
| No log        | 1.6364 | 162  | 0.9608          | 0.4583  | 0.9608 | 0.9802 |
| No log        | 1.6566 | 164  | 1.0409          | 0.4389  | 1.0409 | 1.0203 |
| No log        | 1.6768 | 166  | 0.9013          | 0.4864  | 0.9013 | 0.9494 |
| No log        | 1.6970 | 168  | 0.6828          | 0.5692  | 0.6828 | 0.8263 |
| No log        | 1.7172 | 170  | 0.5449          | 0.5895  | 0.5449 | 0.7382 |
| No log        | 1.7374 | 172  | 0.5228          | 0.5939  | 0.5228 | 0.7231 |
| No log        | 1.7576 | 174  | 0.4980          | 0.5880  | 0.4980 | 0.7057 |
| No log        | 1.7778 | 176  | 0.4385          | 0.5636  | 0.4385 | 0.6622 |
| No log        | 1.7980 | 178  | 0.4946          | 0.4494  | 0.4946 | 0.7033 |
| No log        | 1.8182 | 180  | 0.7067          | 0.2774  | 0.7067 | 0.8406 |
| No log        | 1.8384 | 182  | 0.9071          | 0.1782  | 0.9071 | 0.9524 |
| No log        | 1.8586 | 184  | 0.9211          | 0.1128  | 0.9211 | 0.9597 |
| No log        | 1.8788 | 186  | 0.8185          | 0.0547  | 0.8185 | 0.9047 |
| No log        | 1.8990 | 188  | 0.6726          | 0.1653  | 0.6726 | 0.8201 |
| No log        | 1.9192 | 190  | 0.5304          | 0.3613  | 0.5304 | 0.7283 |
| No log        | 1.9394 | 192  | 0.4775          | 0.5232  | 0.4775 | 0.6910 |
| No log        | 1.9596 | 194  | 0.6332          | 0.3338  | 0.6332 | 0.7957 |
| No log        | 1.9798 | 196  | 0.7073          | 0.2729  | 0.7073 | 0.8410 |
| No log        | 2.0    | 198  | 0.6526          | 0.3217  | 0.6526 | 0.8078 |
| No log        | 2.0202 | 200  | 0.5171          | 0.4965  | 0.5171 | 0.7191 |
| No log        | 2.0404 | 202  | 0.4123          | 0.6020  | 0.4123 | 0.6421 |
| No log        | 2.0606 | 204  | 0.4445          | 0.4542  | 0.4445 | 0.6667 |
| No log        | 2.0808 | 206  | 0.5704          | 0.4695  | 0.5704 | 0.7553 |
| No log        | 2.1010 | 208  | 0.7118          | 0.4495  | 0.7118 | 0.8437 |
| No log        | 2.1212 | 210  | 0.7145          | 0.4902  | 0.7145 | 0.8453 |
| No log        | 2.1414 | 212  | 0.5935          | 0.6089  | 0.5935 | 0.7704 |
| No log        | 2.1616 | 214  | 0.5078          | 0.6403  | 0.5078 | 0.7126 |
| No log        | 2.1818 | 216  | 0.5159          | 0.6563  | 0.5159 | 0.7182 |
| No log        | 2.2020 | 218  | 0.4520          | 0.6339  | 0.4520 | 0.6723 |
| No log        | 2.2222 | 220  | 0.4077          | 0.6269  | 0.4077 | 0.6385 |
| No log        | 2.2424 | 222  | 0.4984          | 0.5571  | 0.4984 | 0.7060 |
| No log        | 2.2626 | 224  | 0.5830          | 0.4955  | 0.5830 | 0.7635 |
| No log        | 2.2828 | 226  | 0.5514          | 0.4611  | 0.5514 | 0.7425 |
| No log        | 2.3030 | 228  | 0.5456          | 0.4514  | 0.5456 | 0.7387 |
| No log        | 2.3232 | 230  | 0.4845          | 0.5259  | 0.4845 | 0.6961 |
| No log        | 2.3434 | 232  | 0.4351          | 0.5211  | 0.4351 | 0.6596 |
| No log        | 2.3636 | 234  | 0.4361          | 0.5370  | 0.4361 | 0.6604 |
| No log        | 2.3838 | 236  | 0.4190          | 0.5777  | 0.4190 | 0.6473 |
| No log        | 2.4040 | 238  | 0.4437          | 0.5362  | 0.4437 | 0.6661 |
| No log        | 2.4242 | 240  | 0.5630          | 0.5341  | 0.5630 | 0.7503 |
| No log        | 2.4444 | 242  | 0.6251          | 0.5162  | 0.6251 | 0.7906 |
| No log        | 2.4646 | 244  | 0.6731          | 0.5253  | 0.6731 | 0.8204 |
| No log        | 2.4848 | 246  | 0.5797          | 0.5555  | 0.5797 | 0.7614 |
| No log        | 2.5051 | 248  | 0.4386          | 0.6368  | 0.4386 | 0.6622 |
| No log        | 2.5253 | 250  | 0.4205          | 0.6658  | 0.4205 | 0.6484 |
| No log        | 2.5455 | 252  | 0.4122          | 0.6556  | 0.4122 | 0.6420 |
| No log        | 2.5657 | 254  | 0.4206          | 0.6222  | 0.4206 | 0.6486 |
| No log        | 2.5859 | 256  | 0.5747          | 0.5804  | 0.5747 | 0.7581 |
| No log        | 2.6061 | 258  | 0.7764          | 0.4642  | 0.7764 | 0.8811 |
| No log        | 2.6263 | 260  | 0.7382          | 0.4742  | 0.7382 | 0.8592 |
| No log        | 2.6465 | 262  | 0.5983          | 0.4951  | 0.5983 | 0.7735 |
| No log        | 2.6667 | 264  | 0.4368          | 0.5626  | 0.4368 | 0.6609 |
| No log        | 2.6869 | 266  | 0.4332          | 0.5797  | 0.4332 | 0.6582 |
| No log        | 2.7071 | 268  | 0.4273          | 0.5756  | 0.4273 | 0.6537 |
| No log        | 2.7273 | 270  | 0.5086          | 0.5001  | 0.5086 | 0.7132 |
| No log        | 2.7475 | 272  | 0.5652          | 0.4814  | 0.5652 | 0.7518 |
| No log        | 2.7677 | 274  | 0.5486          | 0.4847  | 0.5486 | 0.7407 |
| No log        | 2.7879 | 276  | 0.4850          | 0.5055  | 0.4850 | 0.6964 |
| No log        | 2.8081 | 278  | 0.4041          | 0.6093  | 0.4041 | 0.6357 |
| No log        | 2.8283 | 280  | 0.3914          | 0.6175  | 0.3914 | 0.6257 |
| No log        | 2.8485 | 282  | 0.3953          | 0.6292  | 0.3953 | 0.6287 |
| No log        | 2.8687 | 284  | 0.5139          | 0.5607  | 0.5139 | 0.7169 |
| No log        | 2.8889 | 286  | 0.6306          | 0.5397  | 0.6306 | 0.7941 |
| No log        | 2.9091 | 288  | 0.6704          | 0.5437  | 0.6704 | 0.8188 |
| No log        | 2.9293 | 290  | 0.6487          | 0.5360  | 0.6487 | 0.8054 |
| No log        | 2.9495 | 292  | 0.5224          | 0.5928  | 0.5224 | 0.7228 |
| No log        | 2.9697 | 294  | 0.4781          | 0.6161  | 0.4781 | 0.6915 |
| No log        | 2.9899 | 296  | 0.5448          | 0.5328  | 0.5448 | 0.7381 |
| No log        | 3.0101 | 298  | 0.5716          | 0.4956  | 0.5716 | 0.7561 |
| No log        | 3.0303 | 300  | 0.5130          | 0.4976  | 0.5130 | 0.7163 |
| No log        | 3.0505 | 302  | 0.5222          | 0.4777  | 0.5222 | 0.7226 |
| No log        | 3.0707 | 304  | 0.5823          | 0.4544  | 0.5823 | 0.7631 |
| No log        | 3.0909 | 306  | 0.5318          | 0.5025  | 0.5318 | 0.7293 |
| No log        | 3.1111 | 308  | 0.4175          | 0.5744  | 0.4175 | 0.6461 |
| No log        | 3.1313 | 310  | 0.3828          | 0.5991  | 0.3828 | 0.6187 |
| No log        | 3.1515 | 312  | 0.3858          | 0.6115  | 0.3858 | 0.6212 |
| No log        | 3.1717 | 314  | 0.3837          | 0.6326  | 0.3837 | 0.6194 |
| No log        | 3.1919 | 316  | 0.3843          | 0.6532  | 0.3843 | 0.6199 |
| No log        | 3.2121 | 318  | 0.4211          | 0.5706  | 0.4211 | 0.6489 |
| No log        | 3.2323 | 320  | 0.6338          | 0.5258  | 0.6338 | 0.7961 |
| No log        | 3.2525 | 322  | 0.7523          | 0.4875  | 0.7523 | 0.8673 |
| No log        | 3.2727 | 324  | 0.6793          | 0.4836  | 0.6793 | 0.8242 |
| No log        | 3.2929 | 326  | 0.5077          | 0.5505  | 0.5077 | 0.7125 |
| No log        | 3.3131 | 328  | 0.4411          | 0.5805  | 0.4411 | 0.6641 |
| No log        | 3.3333 | 330  | 0.4461          | 0.5620  | 0.4461 | 0.6679 |
| No log        | 3.3535 | 332  | 0.4497          | 0.5484  | 0.4497 | 0.6706 |
| No log        | 3.3737 | 334  | 0.5390          | 0.5454  | 0.5390 | 0.7342 |
| No log        | 3.3939 | 336  | 0.6627          | 0.5445  | 0.6627 | 0.8141 |
| No log        | 3.4141 | 338  | 0.6997          | 0.5645  | 0.6997 | 0.8365 |
| No log        | 3.4343 | 340  | 0.6324          | 0.5804  | 0.6324 | 0.7952 |
| No log        | 3.4545 | 342  | 0.5730          | 0.6036  | 0.5730 | 0.7570 |
| No log        | 3.4747 | 344  | 0.5281          | 0.6255  | 0.5281 | 0.7267 |
| No log        | 3.4949 | 346  | 0.4826          | 0.6587  | 0.4826 | 0.6947 |
| No log        | 3.5152 | 348  | 0.4558          | 0.6407  | 0.4558 | 0.6751 |
| No log        | 3.5354 | 350  | 0.5483          | 0.5802  | 0.5483 | 0.7405 |
| No log        | 3.5556 | 352  | 0.7012          | 0.5439  | 0.7012 | 0.8374 |
| No log        | 3.5758 | 354  | 0.6683          | 0.5453  | 0.6683 | 0.8175 |
| No log        | 3.5960 | 356  | 0.5045          | 0.5776  | 0.5045 | 0.7103 |
| No log        | 3.6162 | 358  | 0.4279          | 0.5968  | 0.4279 | 0.6541 |
| No log        | 3.6364 | 360  | 0.4173          | 0.5779  | 0.4173 | 0.6460 |
| No log        | 3.6566 | 362  | 0.5126          | 0.5916  | 0.5126 | 0.7160 |
| No log        | 3.6768 | 364  | 0.7432          | 0.5070  | 0.7432 | 0.8621 |
| No log        | 3.6970 | 366  | 0.7413          | 0.5142  | 0.7413 | 0.8610 |
| No log        | 3.7172 | 368  | 0.5595          | 0.5376  | 0.5595 | 0.7480 |
| No log        | 3.7374 | 370  | 0.4554          | 0.5365  | 0.4554 | 0.6748 |
| No log        | 3.7576 | 372  | 0.4155          | 0.5693  | 0.4155 | 0.6446 |
| No log        | 3.7778 | 374  | 0.4052          | 0.6221  | 0.4052 | 0.6365 |
| No log        | 3.7980 | 376  | 0.3938          | 0.6386  | 0.3938 | 0.6275 |
| No log        | 3.8182 | 378  | 0.3988          | 0.6046  | 0.3988 | 0.6315 |
| No log        | 3.8384 | 380  | 0.5153          | 0.5774  | 0.5153 | 0.7179 |
| No log        | 3.8586 | 382  | 0.6085          | 0.5489  | 0.6085 | 0.7801 |
| No log        | 3.8788 | 384  | 0.7164          | 0.5411  | 0.7164 | 0.8464 |
| No log        | 3.8990 | 386  | 0.6803          | 0.5186  | 0.6803 | 0.8248 |
| No log        | 3.9192 | 388  | 0.6235          | 0.5188  | 0.6235 | 0.7896 |
| No log        | 3.9394 | 390  | 0.5010          | 0.5640  | 0.5010 | 0.7078 |
| No log        | 3.9596 | 392  | 0.4012          | 0.6279  | 0.4012 | 0.6334 |
| No log        | 3.9798 | 394  | 0.4024          | 0.6738  | 0.4024 | 0.6343 |
| No log        | 4.0    | 396  | 0.3957          | 0.6689  | 0.3957 | 0.6291 |
| No log        | 4.0202 | 398  | 0.3937          | 0.6219  | 0.3937 | 0.6275 |
| No log        | 4.0404 | 400  | 0.4746          | 0.5574  | 0.4746 | 0.6889 |
| No log        | 4.0606 | 402  | 0.5537          | 0.5130  | 0.5537 | 0.7441 |
| No log        | 4.0808 | 404  | 0.5097          | 0.5569  | 0.5097 | 0.7139 |
| No log        | 4.1010 | 406  | 0.4290          | 0.6113  | 0.4290 | 0.6550 |
| No log        | 4.1212 | 408  | 0.4468          | 0.6207  | 0.4468 | 0.6684 |
| No log        | 4.1414 | 410  | 0.5683          | 0.5570  | 0.5683 | 0.7538 |
| No log        | 4.1616 | 412  | 0.6967          | 0.5373  | 0.6967 | 0.8347 |
| No log        | 4.1818 | 414  | 0.6549          | 0.5198  | 0.6549 | 0.8093 |
| No log        | 4.2020 | 416  | 0.5890          | 0.5640  | 0.5890 | 0.7675 |
| No log        | 4.2222 | 418  | 0.5093          | 0.5807  | 0.5093 | 0.7136 |
| No log        | 4.2424 | 420  | 0.4590          | 0.6166  | 0.4590 | 0.6775 |
| No log        | 4.2626 | 422  | 0.4381          | 0.6547  | 0.4381 | 0.6619 |
| No log        | 4.2828 | 424  | 0.4865          | 0.6325  | 0.4865 | 0.6975 |
| No log        | 4.3030 | 426  | 0.7020          | 0.5403  | 0.7020 | 0.8378 |
| No log        | 4.3232 | 428  | 0.8226          | 0.4940  | 0.8226 | 0.9070 |
| No log        | 4.3434 | 430  | 0.7989          | 0.5047  | 0.7989 | 0.8938 |
| No log        | 4.3636 | 432  | 0.6589          | 0.5190  | 0.6589 | 0.8117 |
| No log        | 4.3838 | 434  | 0.5398          | 0.5569  | 0.5398 | 0.7347 |
| No log        | 4.4040 | 436  | 0.4987          | 0.5374  | 0.4987 | 0.7062 |
| No log        | 4.4242 | 438  | 0.5602          | 0.5124  | 0.5602 | 0.7485 |
| No log        | 4.4444 | 440  | 0.6644          | 0.4928  | 0.6644 | 0.8151 |
| No log        | 4.4646 | 442  | 0.6825          | 0.4928  | 0.6825 | 0.8261 |
| No log        | 4.4848 | 444  | 0.5385          | 0.5496  | 0.5385 | 0.7338 |
| No log        | 4.5051 | 446  | 0.4161          | 0.6074  | 0.4161 | 0.6451 |
| No log        | 4.5253 | 448  | 0.4540          | 0.6028  | 0.4540 | 0.6738 |
| No log        | 4.5455 | 450  | 0.4453          | 0.5870  | 0.4453 | 0.6673 |
| No log        | 4.5657 | 452  | 0.4186          | 0.5888  | 0.4186 | 0.6470 |
| No log        | 4.5859 | 454  | 0.5353          | 0.4886  | 0.5353 | 0.7317 |
| No log        | 4.6061 | 456  | 0.6863          | 0.4699  | 0.6863 | 0.8284 |
| No log        | 4.6263 | 458  | 0.7319          | 0.4628  | 0.7319 | 0.8555 |
| No log        | 4.6465 | 460  | 0.6445          | 0.4558  | 0.6445 | 0.8028 |
| No log        | 4.6667 | 462  | 0.4903          | 0.4908  | 0.4903 | 0.7002 |
| No log        | 4.6869 | 464  | 0.4119          | 0.5848  | 0.4119 | 0.6418 |
| No log        | 4.7071 | 466  | 0.4092          | 0.6340  | 0.4092 | 0.6397 |
| No log        | 4.7273 | 468  | 0.4545          | 0.6240  | 0.4545 | 0.6741 |
| No log        | 4.7475 | 470  | 0.5332          | 0.6392  | 0.5332 | 0.7302 |
| No log        | 4.7677 | 472  | 0.6801          | 0.5491  | 0.6801 | 0.8247 |
| No log        | 4.7879 | 474  | 0.7551          | 0.5393  | 0.7551 | 0.8689 |
| No log        | 4.8081 | 476  | 0.6357          | 0.5539  | 0.6357 | 0.7973 |
| No log        | 4.8283 | 478  | 0.4466          | 0.6553  | 0.4466 | 0.6683 |
| No log        | 4.8485 | 480  | 0.4200          | 0.6928  | 0.4200 | 0.6481 |
| No log        | 4.8687 | 482  | 0.4107          | 0.6719  | 0.4107 | 0.6408 |
| No log        | 4.8889 | 484  | 0.4465          | 0.6082  | 0.4465 | 0.6682 |
| No log        | 4.9091 | 486  | 0.6184          | 0.5220  | 0.6184 | 0.7864 |
| No log        | 4.9293 | 488  | 0.7473          | 0.4747  | 0.7473 | 0.8645 |
| No log        | 4.9495 | 490  | 0.7110          | 0.4797  | 0.7110 | 0.8432 |
| No log        | 4.9697 | 492  | 0.7197          | 0.4793  | 0.7197 | 0.8483 |
| No log        | 4.9899 | 494  | 0.7183          | 0.4885  | 0.7183 | 0.8475 |
| No log        | 5.0101 | 496  | 0.5559          | 0.5454  | 0.5559 | 0.7456 |
| No log        | 5.0303 | 498  | 0.4500          | 0.6346  | 0.4500 | 0.6708 |
| 0.4984        | 5.0505 | 500  | 0.4457          | 0.6332  | 0.4457 | 0.6676 |
| 0.4984        | 5.0707 | 502  | 0.5039          | 0.6049  | 0.5039 | 0.7099 |
| 0.4984        | 5.0909 | 504  | 0.6075          | 0.5791  | 0.6075 | 0.7794 |
| 0.4984        | 5.1111 | 506  | 0.7540          | 0.5272  | 0.7540 | 0.8684 |
| 0.4984        | 5.1313 | 508  | 0.6913          | 0.5311  | 0.6913 | 0.8314 |
| 0.4984        | 5.1515 | 510  | 0.5806          | 0.5631  | 0.5806 | 0.7620 |


### Framework versions

- Transformers 4.44.2
- Pytorch 2.4.0+cu118
- Datasets 2.21.0
- Tokenizers 0.19.1