ArabicNewSplits7_B_usingALLEssays_FineTuningAraBERT_run3_AugV5_k4_task2_organization

This model is a fine-tuned version of aubmindlab/bert-base-arabertv02 on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 1.0995
  • Qwk: 0.2744
  • Mse: 1.0995
  • Rmse: 1.0486

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 100

Training results

Training Loss Epoch Step Validation Loss Qwk Mse Rmse
No log 0.1 2 4.5872 0.0010 4.5872 2.1418
No log 0.2 4 2.6599 -0.0180 2.6599 1.6309
No log 0.3 6 1.9680 -0.0575 1.9680 1.4029
No log 0.4 8 1.6159 -0.0109 1.6159 1.2712
No log 0.5 10 1.7703 -0.1219 1.7703 1.3305
No log 0.6 12 1.6657 -0.1352 1.6657 1.2906
No log 0.7 14 1.4222 -0.1340 1.4221 1.1925
No log 0.8 16 1.2857 0.1110 1.2857 1.1339
No log 0.9 18 1.3680 0.0299 1.3680 1.1696
No log 1.0 20 1.5508 0.0512 1.5508 1.2453
No log 1.1 22 1.3578 0.1959 1.3578 1.1652
No log 1.2 24 1.1695 0.2478 1.1695 1.0814
No log 1.3 26 1.1631 0.1984 1.1631 1.0785
No log 1.4 28 1.1397 0.2532 1.1397 1.0676
No log 1.5 30 1.1624 0.1138 1.1624 1.0781
No log 1.6 32 1.1969 0.1448 1.1969 1.0940
No log 1.7 34 1.2772 0.1619 1.2772 1.1301
No log 1.8 36 1.4957 0.1756 1.4957 1.2230
No log 1.9 38 2.1763 0.1018 2.1763 1.4752
No log 2.0 40 2.7098 0.0618 2.7098 1.6462
No log 2.1 42 2.6351 0.0657 2.6351 1.6233
No log 2.2 44 1.9999 0.1145 1.9999 1.4142
No log 2.3 46 1.3115 0.1371 1.3115 1.1452
No log 2.4 48 1.0823 0.2939 1.0823 1.0403
No log 2.5 50 1.1412 0.1722 1.1412 1.0682
No log 2.6 52 1.1275 0.3478 1.1275 1.0618
No log 2.7 54 1.1382 0.1975 1.1382 1.0669
No log 2.8 56 1.4857 0.1742 1.4857 1.2189
No log 2.9 58 1.6993 0.2315 1.6993 1.3036
No log 3.0 60 2.0233 0.1485 2.0233 1.4224
No log 3.1 62 2.0422 0.1485 2.0422 1.4290
No log 3.2 64 1.6632 0.2048 1.6632 1.2897
No log 3.3 66 1.4054 0.2115 1.4054 1.1855
No log 3.4 68 1.3484 0.1979 1.3484 1.1612
No log 3.5 70 1.2882 0.2676 1.2882 1.1350
No log 3.6 72 1.1714 0.3005 1.1714 1.0823
No log 3.7 74 1.1475 0.2718 1.1475 1.0712
No log 3.8 76 1.0245 0.3199 1.0245 1.0122
No log 3.9 78 1.0275 0.3199 1.0275 1.0136
No log 4.0 80 1.1626 0.2772 1.1626 1.0782
No log 4.1 82 1.1937 0.2807 1.1937 1.0925
No log 4.2 84 1.0885 0.2605 1.0885 1.0433
No log 4.3 86 1.0476 0.3639 1.0476 1.0235
No log 4.4 88 1.0670 0.2939 1.0670 1.0329
No log 4.5 90 1.0514 0.3641 1.0514 1.0254
No log 4.6 92 1.0841 0.3059 1.0841 1.0412
No log 4.7 94 1.2652 0.2280 1.2652 1.1248
No log 4.8 96 1.2984 0.1972 1.2984 1.1395
No log 4.9 98 1.1641 0.2374 1.1641 1.0789
No log 5.0 100 1.1193 0.2088 1.1193 1.0580
No log 5.1 102 1.1156 0.2410 1.1156 1.0562
No log 5.2 104 1.0678 0.3544 1.0678 1.0333
No log 5.3 106 1.1066 0.3399 1.1066 1.0519
No log 5.4 108 1.0751 0.4024 1.0751 1.0369
No log 5.5 110 0.9690 0.5555 0.9690 0.9844
No log 5.6 112 1.0326 0.4782 1.0326 1.0162
No log 5.7 114 0.9950 0.4792 0.9950 0.9975
No log 5.8 116 0.9359 0.6002 0.9359 0.9674
No log 5.9 118 0.9592 0.4582 0.9592 0.9794
No log 6.0 120 0.9970 0.4364 0.9970 0.9985
No log 6.1 122 1.0605 0.3213 1.0605 1.0298
No log 6.2 124 1.0885 0.3008 1.0885 1.0433
No log 6.3 126 0.9976 0.3433 0.9976 0.9988
No log 6.4 128 0.9899 0.3862 0.9899 0.9950
No log 6.5 130 1.0026 0.3678 1.0026 1.0013
No log 6.6 132 0.9759 0.4294 0.9759 0.9879
No log 6.7 134 1.0098 0.4556 1.0098 1.0049
No log 6.8 136 1.1593 0.3332 1.1593 1.0767
No log 6.9 138 1.3276 0.3448 1.3276 1.1522
No log 7.0 140 1.2727 0.3177 1.2727 1.1281
No log 7.1 142 1.1580 0.4004 1.1580 1.0761
No log 7.2 144 1.1342 0.4172 1.1342 1.0650
No log 7.3 146 1.0789 0.4120 1.0789 1.0387
No log 7.4 148 1.0561 0.3773 1.0561 1.0276
No log 7.5 150 1.0697 0.3397 1.0697 1.0343
No log 7.6 152 1.1266 0.3207 1.1266 1.0614
No log 7.7 154 1.0798 0.3843 1.0798 1.0392
No log 7.8 156 1.0124 0.4082 1.0124 1.0062
No log 7.9 158 0.9506 0.3554 0.9506 0.9750
No log 8.0 160 0.9644 0.4091 0.9644 0.9821
No log 8.1 162 0.9830 0.4337 0.9830 0.9915
No log 8.2 164 0.9628 0.4750 0.9628 0.9812
No log 8.3 166 1.1153 0.4731 1.1153 1.0561
No log 8.4 168 1.1955 0.4705 1.1955 1.0934
No log 8.5 170 1.0437 0.5244 1.0437 1.0216
No log 8.6 172 0.9805 0.4430 0.9805 0.9902
No log 8.7 174 0.9977 0.3980 0.9977 0.9989
No log 8.8 176 1.1254 0.3949 1.1254 1.0608
No log 8.9 178 1.1233 0.3719 1.1233 1.0599
No log 9.0 180 1.1177 0.4086 1.1177 1.0572
No log 9.1 182 1.0827 0.3546 1.0827 1.0405
No log 9.2 184 1.0326 0.4080 1.0326 1.0162
No log 9.3 186 1.0313 0.4080 1.0313 1.0156
No log 9.4 188 1.1440 0.3361 1.1440 1.0696
No log 9.5 190 1.2092 0.3560 1.2092 1.0997
No log 9.6 192 1.3465 0.2679 1.3465 1.1604
No log 9.7 194 1.4593 0.3101 1.4593 1.2080
No log 9.8 196 1.2515 0.3129 1.2515 1.1187
No log 9.9 198 1.0267 0.4191 1.0267 1.0133
No log 10.0 200 0.9745 0.4964 0.9745 0.9872
No log 10.1 202 0.9705 0.5149 0.9705 0.9851
No log 10.2 204 1.0261 0.4556 1.0261 1.0130
No log 10.3 206 1.0188 0.4119 1.0188 1.0094
No log 10.4 208 0.9837 0.4364 0.9837 0.9918
No log 10.5 210 0.9920 0.4607 0.9920 0.9960
No log 10.6 212 0.9989 0.4607 0.9989 0.9994
No log 10.7 214 1.0037 0.4640 1.0037 1.0019
No log 10.8 216 1.0229 0.4335 1.0229 1.0114
No log 10.9 218 1.1380 0.3475 1.1380 1.0668
No log 11.0 220 1.2256 0.3106 1.2256 1.1071
No log 11.1 222 1.1489 0.3339 1.1489 1.0719
No log 11.2 224 1.0288 0.4483 1.0288 1.0143
No log 11.3 226 1.0059 0.3908 1.0059 1.0029
No log 11.4 228 1.0196 0.3787 1.0196 1.0098
No log 11.5 230 0.9990 0.3711 0.9990 0.9995
No log 11.6 232 1.0384 0.3433 1.0384 1.0190
No log 11.7 234 1.1469 0.3267 1.1469 1.0709
No log 11.8 236 1.1143 0.3613 1.1143 1.0556
No log 11.9 238 1.0433 0.3781 1.0433 1.0214
No log 12.0 240 1.0320 0.3304 1.0320 1.0159
No log 12.1 242 1.0391 0.3304 1.0391 1.0193
No log 12.2 244 1.0612 0.3865 1.0612 1.0302
No log 12.3 246 1.1323 0.3487 1.1323 1.0641
No log 12.4 248 1.2174 0.3249 1.2174 1.1034
No log 12.5 250 1.2089 0.3249 1.2089 1.0995
No log 12.6 252 1.0692 0.3397 1.0692 1.0340
No log 12.7 254 1.0140 0.2803 1.0140 1.0070
No log 12.8 256 1.0217 0.3256 1.0217 1.0108
No log 12.9 258 1.0709 0.2700 1.0709 1.0348
No log 13.0 260 1.2172 0.2870 1.2172 1.1033
No log 13.1 262 1.3855 0.2885 1.3855 1.1771
No log 13.2 264 1.3572 0.2981 1.3572 1.1650
No log 13.3 266 1.3152 0.2863 1.3152 1.1468
No log 13.4 268 1.1878 0.3500 1.1878 1.0899
No log 13.5 270 1.0946 0.3490 1.0946 1.0462
No log 13.6 272 1.0783 0.3625 1.0783 1.0384
No log 13.7 274 1.0377 0.3268 1.0377 1.0187
No log 13.8 276 1.0262 0.3174 1.0262 1.0130
No log 13.9 278 1.0267 0.3542 1.0267 1.0133
No log 14.0 280 1.0288 0.3719 1.0288 1.0143
No log 14.1 282 1.0036 0.3180 1.0036 1.0018
No log 14.2 284 1.0003 0.3180 1.0003 1.0002
No log 14.3 286 1.0142 0.4004 1.0142 1.0071
No log 14.4 288 1.0101 0.4331 1.0101 1.0050
No log 14.5 290 1.0039 0.3796 1.0039 1.0020
No log 14.6 292 1.0212 0.4268 1.0212 1.0105
No log 14.7 294 1.0459 0.3902 1.0459 1.0227
No log 14.8 296 1.0892 0.3494 1.0892 1.0436
No log 14.9 298 1.1070 0.3887 1.1070 1.0521
No log 15.0 300 1.1000 0.4156 1.1000 1.0488
No log 15.1 302 1.0504 0.3952 1.0504 1.0249
No log 15.2 304 1.0500 0.3678 1.0500 1.0247
No log 15.3 306 1.0598 0.3262 1.0598 1.0295
No log 15.4 308 1.0284 0.3869 1.0284 1.0141
No log 15.5 310 1.0142 0.3525 1.0142 1.0071
No log 15.6 312 1.0146 0.3618 1.0146 1.0073
No log 15.7 314 1.0680 0.3865 1.0680 1.0334
No log 15.8 316 1.0728 0.4211 1.0728 1.0358
No log 15.9 318 1.0858 0.4068 1.0858 1.0420
No log 16.0 320 1.0736 0.3453 1.0736 1.0361
No log 16.1 322 1.0822 0.3453 1.0822 1.0403
No log 16.2 324 1.0560 0.3169 1.0560 1.0276
No log 16.3 326 1.0169 0.3154 1.0169 1.0084
No log 16.4 328 0.9961 0.3108 0.9961 0.9980
No log 16.5 330 0.9976 0.3013 0.9976 0.9988
No log 16.6 332 1.0734 0.4259 1.0734 1.0361
No log 16.7 334 1.1396 0.3998 1.1396 1.0675
No log 16.8 336 1.0824 0.4426 1.0824 1.0404
No log 16.9 338 0.9637 0.3607 0.9637 0.9817
No log 17.0 340 0.9438 0.4288 0.9438 0.9715
No log 17.1 342 0.9544 0.3960 0.9544 0.9769
No log 17.2 344 1.0187 0.4426 1.0187 1.0093
No log 17.3 346 1.0288 0.4224 1.0288 1.0143
No log 17.4 348 0.9648 0.3827 0.9648 0.9823
No log 17.5 350 0.9479 0.4385 0.9479 0.9736
No log 17.6 352 0.9535 0.3796 0.9535 0.9765
No log 17.7 354 0.9825 0.3532 0.9825 0.9912
No log 17.8 356 0.9960 0.3519 0.9960 0.9980
No log 17.9 358 0.9717 0.4233 0.9717 0.9857
No log 18.0 360 0.9623 0.4066 0.9623 0.9810
No log 18.1 362 0.9756 0.4007 0.9756 0.9877
No log 18.2 364 0.9793 0.3617 0.9793 0.9896
No log 18.3 366 1.0158 0.4745 1.0158 1.0079
No log 18.4 368 1.0678 0.3883 1.0678 1.0334
No log 18.5 370 1.1074 0.3883 1.1074 1.0523
No log 18.6 372 1.0381 0.4139 1.0381 1.0189
No log 18.7 374 0.9651 0.4070 0.9651 0.9824
No log 18.8 376 0.9440 0.3678 0.9440 0.9716
No log 18.9 378 0.9647 0.3628 0.9647 0.9822
No log 19.0 380 1.0577 0.4336 1.0577 1.0285
No log 19.1 382 1.1250 0.4736 1.1250 1.0606
No log 19.2 384 1.0808 0.4341 1.0808 1.0396
No log 19.3 386 1.0569 0.3624 1.0569 1.0281
No log 19.4 388 1.1218 0.3534 1.1218 1.0592
No log 19.5 390 1.1432 0.3137 1.1432 1.0692
No log 19.6 392 1.1577 0.3117 1.1577 1.0760
No log 19.7 394 1.1365 0.2916 1.1365 1.0661
No log 19.8 396 1.1362 0.3298 1.1362 1.0659
No log 19.9 398 1.1135 0.3883 1.1135 1.0552
No log 20.0 400 1.1281 0.3883 1.1281 1.0621
No log 20.1 402 1.1681 0.3429 1.1681 1.0808
No log 20.2 404 1.1144 0.3595 1.1144 1.0556
No log 20.3 406 1.0175 0.3865 1.0175 1.0087
No log 20.4 408 0.9912 0.3522 0.9912 0.9956
No log 20.5 410 1.0106 0.3664 1.0106 1.0053
No log 20.6 412 0.9882 0.3268 0.9882 0.9941
No log 20.7 414 0.9916 0.3965 0.9916 0.9958
No log 20.8 416 1.0068 0.3412 1.0068 1.0034
No log 20.9 418 1.0150 0.3412 1.0150 1.0075
No log 21.0 420 0.9918 0.3796 0.9918 0.9959
No log 21.1 422 0.9996 0.3268 0.9996 0.9998
No log 21.2 424 1.0016 0.3268 1.0016 1.0008
No log 21.3 426 0.9965 0.3891 0.9965 0.9982
No log 21.4 428 1.0185 0.3913 1.0185 1.0092
No log 21.5 430 1.0439 0.3584 1.0439 1.0217
No log 21.6 432 1.0964 0.3532 1.0964 1.0471
No log 21.7 434 1.1146 0.3805 1.1146 1.0558
No log 21.8 436 1.0771 0.3312 1.0771 1.0378
No log 21.9 438 1.0597 0.2588 1.0597 1.0294
No log 22.0 440 1.0688 0.2327 1.0688 1.0338
No log 22.1 442 1.0840 0.2431 1.0840 1.0412
No log 22.2 444 1.1397 0.2871 1.1397 1.0675
No log 22.3 446 1.2282 0.2825 1.2282 1.1082
No log 22.4 448 1.3498 0.2815 1.3498 1.1618
No log 22.5 450 1.3392 0.3370 1.3392 1.1572
No log 22.6 452 1.2186 0.3157 1.2186 1.1039
No log 22.7 454 1.1342 0.2902 1.1342 1.0650
No log 22.8 456 1.0999 0.2777 1.0999 1.0487
No log 22.9 458 1.0907 0.2911 1.0907 1.0444
No log 23.0 460 1.0850 0.2777 1.0850 1.0416
No log 23.1 462 1.0952 0.2650 1.0952 1.0465
No log 23.2 464 1.1203 0.2685 1.1203 1.0584
No log 23.3 466 1.1053 0.3010 1.1053 1.0513
No log 23.4 468 1.0794 0.2824 1.0794 1.0389
No log 23.5 470 1.0800 0.3126 1.0800 1.0392
No log 23.6 472 1.0970 0.3126 1.0970 1.0474
No log 23.7 474 1.0667 0.3218 1.0667 1.0328
No log 23.8 476 1.0572 0.2786 1.0572 1.0282
No log 23.9 478 1.0392 0.3016 1.0392 1.0194
No log 24.0 480 1.0376 0.3105 1.0376 1.0186
No log 24.1 482 1.0379 0.3062 1.0379 1.0188
No log 24.2 484 1.0445 0.3522 1.0445 1.0220
No log 24.3 486 1.0445 0.3016 1.0445 1.0220
No log 24.4 488 1.0587 0.3527 1.0587 1.0289
No log 24.5 490 1.0866 0.3745 1.0866 1.0424
No log 24.6 492 1.1306 0.3115 1.1306 1.0633
No log 24.7 494 1.1325 0.3115 1.1325 1.0642
No log 24.8 496 1.0910 0.3297 1.0910 1.0445
No log 24.9 498 1.0624 0.3194 1.0624 1.0307
0.32 25.0 500 1.0483 0.4066 1.0483 1.0238
0.32 25.1 502 1.0600 0.3476 1.0600 1.0296
0.32 25.2 504 1.0816 0.2624 1.0816 1.0400
0.32 25.3 506 1.0784 0.2624 1.0784 1.0384
0.32 25.4 508 1.0779 0.2777 1.0779 1.0382
0.32 25.5 510 1.0995 0.2744 1.0995 1.0486

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.4.0+cu118
  • Datasets 2.21.0
  • Tokenizers 0.19.1
Downloads last month
-
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for MayBashendy/ArabicNewSplits7_B_usingALLEssays_FineTuningAraBERT_run3_AugV5_k4_task2_organization

Finetuned
(4032)
this model