ArabicNewSplits7_usingWellWrittenEssays_FineTuningAraBERT_run3_AugV5_k7_task7_organization

This model is a fine-tuned version of aubmindlab/bert-base-arabertv02 on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 0.8032
  • Qwk: 0.4052
  • Mse: 0.8032
  • Rmse: 0.8962

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 100

Training results

Training Loss Epoch Step Validation Loss Qwk Mse Rmse
No log 0.1176 2 2.5757 -0.0924 2.5757 1.6049
No log 0.2353 4 1.3587 0.0994 1.3587 1.1656
No log 0.3529 6 1.1844 -0.2292 1.1844 1.0883
No log 0.4706 8 0.9977 -0.0426 0.9977 0.9988
No log 0.5882 10 0.9364 0.1007 0.9364 0.9677
No log 0.7059 12 0.8715 0.1648 0.8715 0.9335
No log 0.8235 14 0.8307 -0.0103 0.8307 0.9114
No log 0.9412 16 0.8269 -0.0483 0.8269 0.9094
No log 1.0588 18 0.9257 0.0495 0.9257 0.9621
No log 1.1765 20 0.9539 -0.0700 0.9539 0.9767
No log 1.2941 22 0.8720 0.0027 0.8720 0.9338
No log 1.4118 24 0.8314 0.0 0.8314 0.9118
No log 1.5294 26 0.8248 0.0 0.8248 0.9082
No log 1.6471 28 0.8307 0.1236 0.8307 0.9114
No log 1.7647 30 0.7798 0.0 0.7798 0.8831
No log 1.8824 32 0.7549 0.0 0.7549 0.8688
No log 2.0 34 0.7683 0.0 0.7683 0.8765
No log 2.1176 36 0.8061 0.0481 0.8061 0.8978
No log 2.2353 38 0.9177 0.2526 0.9177 0.9579
No log 2.3529 40 0.9241 0.3444 0.9241 0.9613
No log 2.4706 42 0.8782 0.3173 0.8782 0.9371
No log 2.5882 44 0.7992 0.1372 0.7992 0.8940
No log 2.7059 46 0.7477 0.0937 0.7477 0.8647
No log 2.8235 48 0.7057 0.0428 0.7057 0.8400
No log 2.9412 50 0.7487 0.3243 0.7487 0.8653
No log 3.0588 52 0.8144 0.1648 0.8144 0.9025
No log 3.1765 54 0.8334 0.1699 0.8334 0.9129
No log 3.2941 56 0.8386 0.1094 0.8386 0.9157
No log 3.4118 58 0.8747 -0.0027 0.8747 0.9353
No log 3.5294 60 0.9345 -0.1275 0.9345 0.9667
No log 3.6471 62 0.8860 -0.0444 0.8860 0.9413
No log 3.7647 64 0.7937 0.0 0.7937 0.8909
No log 3.8824 66 0.7158 0.0889 0.7158 0.8460
No log 4.0 68 0.7009 0.0393 0.7009 0.8372
No log 4.1176 70 0.7320 0.0359 0.7320 0.8556
No log 4.2353 72 0.7952 -0.0051 0.7952 0.8917
No log 4.3529 74 0.8104 0.0265 0.8104 0.9002
No log 4.4706 76 0.8378 0.0927 0.8378 0.9153
No log 4.5882 78 0.8915 0.0966 0.8915 0.9442
No log 4.7059 80 0.9184 0.1699 0.9184 0.9583
No log 4.8235 82 0.9273 0.2171 0.9273 0.9630
No log 4.9412 84 0.9054 0.1972 0.9054 0.9515
No log 5.0588 86 0.9133 0.0245 0.9133 0.9557
No log 5.1765 88 0.8967 0.0968 0.8967 0.9469
No log 5.2941 90 0.9317 0.1303 0.9317 0.9652
No log 5.4118 92 0.9621 0.2063 0.9621 0.9809
No log 5.5294 94 0.9345 0.2632 0.9345 0.9667
No log 5.6471 96 0.8433 0.3238 0.8433 0.9183
No log 5.7647 98 0.8241 0.2007 0.8241 0.9078
No log 5.8824 100 0.8276 -0.0070 0.8276 0.9097
No log 6.0 102 0.8620 0.0362 0.8620 0.9284
No log 6.1176 104 0.8316 0.0697 0.8316 0.9119
No log 6.2353 106 0.8264 0.2345 0.8264 0.9091
No log 6.3529 108 0.8756 0.2604 0.8756 0.9357
No log 6.4706 110 0.8394 0.2171 0.8394 0.9162
No log 6.5882 112 0.8391 0.2063 0.8391 0.9160
No log 6.7059 114 0.8412 0.0 0.8412 0.9172
No log 6.8235 116 0.7238 0.1829 0.7238 0.8508
No log 6.9412 118 0.6557 0.2819 0.6557 0.8098
No log 7.0588 120 0.6879 0.3950 0.6879 0.8294
No log 7.1765 122 0.8563 0.3499 0.8563 0.9254
No log 7.2941 124 0.9871 0.2921 0.9871 0.9935
No log 7.4118 126 0.9991 0.2464 0.9991 0.9995
No log 7.5294 128 1.0648 0.1354 1.0648 1.0319
No log 7.6471 130 1.5336 0.1007 1.5336 1.2384
No log 7.7647 132 1.5717 0.0790 1.5717 1.2537
No log 7.8824 134 1.2385 0.1332 1.2385 1.1129
No log 8.0 136 0.9813 0.0801 0.9813 0.9906
No log 8.1176 138 0.9364 0.2832 0.9364 0.9677
No log 8.2353 140 0.8927 0.3183 0.8927 0.9448
No log 8.3529 142 0.8519 0.3221 0.8519 0.9230
No log 8.4706 144 0.8226 0.2414 0.8226 0.9069
No log 8.5882 146 0.8010 0.2813 0.8010 0.8950
No log 8.7059 148 0.7961 0.2784 0.7961 0.8922
No log 8.8235 150 0.8075 0.3372 0.8075 0.8986
No log 8.9412 152 0.8405 0.3699 0.8405 0.9168
No log 9.0588 154 0.7972 0.3637 0.7972 0.8928
No log 9.1765 156 0.7613 0.3099 0.7613 0.8725
No log 9.2941 158 0.7245 0.1699 0.7245 0.8512
No log 9.4118 160 0.7186 0.1807 0.7186 0.8477
No log 9.5294 162 0.7385 0.1268 0.7385 0.8593
No log 9.6471 164 0.7850 0.2171 0.7850 0.8860
No log 9.7647 166 0.8374 0.2328 0.8374 0.9151
No log 9.8824 168 0.8369 0.1995 0.8369 0.9148
No log 10.0 170 0.8049 0.2589 0.8049 0.8971
No log 10.1176 172 0.8180 0.0652 0.8180 0.9044
No log 10.2353 174 0.8234 0.0652 0.8234 0.9074
No log 10.3529 176 0.7990 0.2027 0.7990 0.8938
No log 10.4706 178 0.7999 0.3372 0.7999 0.8943
No log 10.5882 180 0.7837 0.3564 0.7837 0.8853
No log 10.7059 182 0.6983 0.3032 0.6983 0.8356
No log 10.8235 184 0.6769 0.3425 0.6769 0.8228
No log 10.9412 186 0.7105 0.3127 0.7105 0.8429
No log 11.0588 188 0.7800 0.3372 0.7800 0.8832
No log 11.1765 190 0.8865 0.3782 0.8865 0.9415
No log 11.2941 192 0.8579 0.4251 0.8579 0.9262
No log 11.4118 194 0.7613 0.2784 0.7613 0.8725
No log 11.5294 196 0.7461 0.2683 0.7461 0.8637
No log 11.6471 198 0.7541 0.2683 0.7541 0.8684
No log 11.7647 200 0.7541 0.2319 0.7541 0.8684
No log 11.8824 202 0.8252 0.4089 0.8252 0.9084
No log 12.0 204 0.8374 0.3590 0.8374 0.9151
No log 12.1176 206 0.8097 0.1264 0.8097 0.8998
No log 12.2353 208 0.8086 0.1723 0.8086 0.8992
No log 12.3529 210 0.8116 0.1051 0.8116 0.9009
No log 12.4706 212 0.7856 0.2158 0.7856 0.8863
No log 12.5882 214 0.7473 0.2319 0.7473 0.8645
No log 12.7059 216 0.7784 0.3894 0.7784 0.8823
No log 12.8235 218 0.7992 0.4014 0.7992 0.8940
No log 12.9412 220 0.7434 0.4076 0.7434 0.8622
No log 13.0588 222 0.7099 0.3545 0.7099 0.8425
No log 13.1765 224 0.6986 0.3622 0.6986 0.8358
No log 13.2941 226 0.7285 0.3545 0.7285 0.8535
No log 13.4118 228 0.8145 0.3372 0.8145 0.9025
No log 13.5294 230 0.9086 0.3131 0.9086 0.9532
No log 13.6471 232 0.9034 0.3195 0.9034 0.9505
No log 13.7647 234 0.8619 0.2847 0.8619 0.9284
No log 13.8824 236 0.8313 0.3737 0.8313 0.9118
No log 14.0 238 0.8160 0.3918 0.8160 0.9033
No log 14.1176 240 0.7708 0.3996 0.7708 0.8780
No log 14.2353 242 0.7486 0.3996 0.7486 0.8652
No log 14.3529 244 0.7968 0.3302 0.7968 0.8926
No log 14.4706 246 0.8710 0.4175 0.8710 0.9333
No log 14.5882 248 0.9417 0.3761 0.9417 0.9704
No log 14.7059 250 0.8782 0.3538 0.8782 0.9371
No log 14.8235 252 0.7827 0.3494 0.7827 0.8847
No log 14.9412 254 0.7067 0.3518 0.7067 0.8406
No log 15.0588 256 0.6833 0.3701 0.6833 0.8266
No log 15.1765 258 0.6791 0.4001 0.6791 0.8241
No log 15.2941 260 0.6811 0.3701 0.6811 0.8253
No log 15.4118 262 0.7124 0.3868 0.7124 0.8440
No log 15.5294 264 0.7480 0.3789 0.7480 0.8649
No log 15.6471 266 0.8122 0.3746 0.8122 0.9012
No log 15.7647 268 0.8056 0.3494 0.8056 0.8975
No log 15.8824 270 0.7901 0.3894 0.7901 0.8888
No log 16.0 272 0.7528 0.4134 0.7528 0.8677
No log 16.1176 274 0.7109 0.3238 0.7109 0.8432
No log 16.2353 276 0.7114 0.3238 0.7114 0.8435
No log 16.3529 278 0.7378 0.3518 0.7378 0.8589
No log 16.4706 280 0.7623 0.3712 0.7623 0.8731
No log 16.5882 282 0.7732 0.3712 0.7732 0.8793
No log 16.7059 284 0.7770 0.3637 0.7770 0.8815
No log 16.8235 286 0.8168 0.3746 0.8168 0.9038
No log 16.9412 288 0.7929 0.3819 0.7929 0.8904
No log 17.0588 290 0.7471 0.3712 0.7471 0.8644
No log 17.1765 292 0.7909 0.3444 0.7909 0.8893
No log 17.2941 294 0.8272 0.2847 0.8272 0.9095
No log 17.4118 296 0.8620 0.2847 0.8620 0.9284
No log 17.5294 298 0.9036 0.2498 0.9036 0.9506
No log 17.6471 300 0.9400 0.2204 0.9400 0.9696
No log 17.7647 302 0.9759 0.1495 0.9759 0.9879
No log 17.8824 304 1.0037 0.1260 1.0037 1.0018
No log 18.0 306 0.9694 0.1636 0.9694 0.9846
No log 18.1176 308 0.8644 0.2751 0.8644 0.9297
No log 18.2353 310 0.7909 0.3253 0.7909 0.8893
No log 18.3529 312 0.7524 0.3840 0.7524 0.8674
No log 18.4706 314 0.7711 0.4597 0.7711 0.8781
No log 18.5882 316 0.7910 0.4597 0.7910 0.8894
No log 18.7059 318 0.7988 0.4684 0.7988 0.8938
No log 18.8235 320 0.7770 0.4182 0.7770 0.8815
No log 18.9412 322 0.7755 0.3123 0.7755 0.8806
No log 19.0588 324 0.7761 0.2685 0.7761 0.8810
No log 19.1765 326 0.7805 0.2685 0.7805 0.8835
No log 19.2941 328 0.8082 0.3518 0.8082 0.8990
No log 19.4118 330 0.8000 0.3518 0.8000 0.8944
No log 19.5294 332 0.8343 0.2662 0.8343 0.9134
No log 19.6471 334 0.8458 0.3261 0.8458 0.9197
No log 19.7647 336 0.8213 0.2950 0.8213 0.9063
No log 19.8824 338 0.7959 0.2685 0.7959 0.8921
No log 20.0 340 0.7812 0.2685 0.7812 0.8839
No log 20.1176 342 0.7656 0.2685 0.7656 0.8750
No log 20.2353 344 0.7817 0.2981 0.7817 0.8841
No log 20.3529 346 0.8332 0.3372 0.8332 0.9128
No log 20.4706 348 0.9192 0.2754 0.9192 0.9588
No log 20.5882 350 0.9276 0.3256 0.9276 0.9631
No log 20.7059 352 0.8733 0.3519 0.8733 0.9345
No log 20.8235 354 0.8068 0.3444 0.8068 0.8982
No log 20.9412 356 0.7996 0.3444 0.7996 0.8942
No log 21.0588 358 0.8035 0.3770 0.8035 0.8964
No log 21.1765 360 0.7836 0.4014 0.7836 0.8852
No log 21.2941 362 0.7520 0.3637 0.7520 0.8672
No log 21.4118 364 0.7257 0.4479 0.7257 0.8519
No log 21.5294 366 0.6960 0.4479 0.6960 0.8343
No log 21.6471 368 0.6806 0.4753 0.6806 0.8250
No log 21.7647 370 0.6986 0.4753 0.6986 0.8358
No log 21.8824 372 0.7108 0.4753 0.7108 0.8431
No log 22.0 374 0.6759 0.4753 0.6759 0.8221
No log 22.1176 376 0.6589 0.4330 0.6589 0.8117
No log 22.2353 378 0.6724 0.4330 0.6724 0.8200
No log 22.3529 380 0.7196 0.4243 0.7196 0.8483
No log 22.4706 382 0.7814 0.3662 0.7814 0.8840
No log 22.5882 384 0.7966 0.4076 0.7966 0.8925
No log 22.7059 386 0.7618 0.3814 0.7618 0.8728
No log 22.8235 388 0.7042 0.4330 0.7042 0.8391
No log 22.9412 390 0.6606 0.3782 0.6606 0.8128
No log 23.0588 392 0.6441 0.3546 0.6441 0.8025
No log 23.1765 394 0.6713 0.3839 0.6713 0.8193
No log 23.2941 396 0.7153 0.4219 0.7153 0.8457
No log 23.4118 398 0.7572 0.4479 0.7572 0.8702
No log 23.5294 400 0.7462 0.4479 0.7462 0.8638
No log 23.6471 402 0.7240 0.4479 0.7240 0.8509
No log 23.7647 404 0.7425 0.4479 0.7425 0.8617
No log 23.8824 406 0.7640 0.4076 0.7640 0.8740
No log 24.0 408 0.8046 0.3399 0.8046 0.8970
No log 24.1176 410 0.8380 0.3221 0.8380 0.9154
No log 24.2353 412 0.8297 0.2558 0.8297 0.9109
No log 24.3529 414 0.8118 0.2981 0.8118 0.9010
No log 24.4706 416 0.7835 0.3445 0.7835 0.8851
No log 24.5882 418 0.7540 0.4354 0.7540 0.8683
No log 24.7059 420 0.7475 0.4076 0.7475 0.8646
No log 24.8235 422 0.7535 0.3918 0.7535 0.8681
No log 24.9412 424 0.7683 0.3918 0.7683 0.8765
No log 25.0588 426 0.7558 0.3918 0.7558 0.8693
No log 25.1765 428 0.7187 0.4753 0.7187 0.8477
No log 25.2941 430 0.7111 0.5017 0.7111 0.8433
No log 25.4118 432 0.7152 0.4705 0.7152 0.8457
No log 25.5294 434 0.7339 0.4597 0.7339 0.8567
No log 25.6471 436 0.7285 0.4597 0.7285 0.8535
No log 25.7647 438 0.7302 0.4597 0.7302 0.8545
No log 25.8824 440 0.7333 0.4597 0.7333 0.8563
No log 26.0 442 0.7579 0.4330 0.7579 0.8706
No log 26.1176 444 0.7453 0.4330 0.7453 0.8633
No log 26.2353 446 0.7115 0.4845 0.7115 0.8435
No log 26.3529 448 0.6994 0.5111 0.6994 0.8363
No log 26.4706 450 0.7057 0.4845 0.7057 0.8401
No log 26.5882 452 0.7387 0.4479 0.7387 0.8595
No log 26.7059 454 0.7575 0.4052 0.7575 0.8703
No log 26.8235 456 0.7273 0.4753 0.7273 0.8528
No log 26.9412 458 0.6859 0.4592 0.6859 0.8282
No log 27.0588 460 0.6751 0.4592 0.6751 0.8216
No log 27.1765 462 0.6819 0.4845 0.6819 0.8258
No log 27.2941 464 0.7339 0.4479 0.7339 0.8567
No log 27.4118 466 0.7853 0.3891 0.7853 0.8862
No log 27.5294 468 0.7891 0.4502 0.7891 0.8883
No log 27.6471 470 0.7554 0.4392 0.7554 0.8692
No log 27.7647 472 0.6916 0.4479 0.6916 0.8316
No log 27.8824 474 0.6736 0.4479 0.6736 0.8207
No log 28.0 476 0.6701 0.4479 0.6701 0.8186
No log 28.1176 478 0.6714 0.4479 0.6714 0.8194
No log 28.2353 480 0.7054 0.4479 0.7054 0.8399
No log 28.3529 482 0.7542 0.4392 0.7542 0.8684
No log 28.4706 484 0.7983 0.4642 0.7983 0.8935
No log 28.5882 486 0.7963 0.4224 0.7963 0.8924
No log 28.7059 488 0.7573 0.4753 0.7573 0.8702
No log 28.8235 490 0.7164 0.4753 0.7164 0.8464
No log 28.9412 492 0.7228 0.4753 0.7228 0.8502
No log 29.0588 494 0.7444 0.4753 0.7444 0.8628
No log 29.1765 496 0.7462 0.4247 0.7462 0.8638
No log 29.2941 498 0.7619 0.4247 0.7619 0.8729
0.311 29.4118 500 0.7913 0.4224 0.7913 0.8895
0.311 29.5294 502 0.7920 0.4224 0.7920 0.8899
0.311 29.6471 504 0.7454 0.3972 0.7454 0.8634
0.311 29.7647 506 0.7056 0.4845 0.7056 0.8400
0.311 29.8824 508 0.7058 0.4845 0.7058 0.8401
0.311 30.0 510 0.7274 0.4845 0.7274 0.8529
0.311 30.1176 512 0.7524 0.4052 0.7524 0.8674
0.311 30.2353 514 0.8111 0.4089 0.8111 0.9006
0.311 30.3529 516 0.8516 0.3940 0.8516 0.9228
0.311 30.4706 518 0.8569 0.3940 0.8569 0.9257
0.311 30.5882 520 0.8032 0.4052 0.8032 0.8962

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.4.0+cu118
  • Datasets 2.21.0
  • Tokenizers 0.19.1
Downloads last month
-
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for MayBashendy/ArabicNewSplits7_usingWellWrittenEssays_FineTuningAraBERT_run3_AugV5_k7_task7_organization

Finetuned
(4019)
this model