ArabicNewSplits7_OSS_usingWellWrittenEssays_FineTuningAraBERT_run1_AugV5_k1_task2_organization

This model is a fine-tuned version of aubmindlab/bert-base-arabertv02 on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 0.8245
  • Qwk: 0.4595
  • Mse: 0.8245
  • Rmse: 0.9080

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 100

Training results

Training Loss Epoch Step Validation Loss Qwk Mse Rmse
No log 0.3333 2 4.4398 0.0163 4.4398 2.1071
No log 0.6667 4 2.6688 -0.0040 2.6688 1.6336
No log 1.0 6 1.6064 0.0372 1.6064 1.2674
No log 1.3333 8 1.2444 0.0600 1.2444 1.1155
No log 1.6667 10 1.1681 0.1689 1.1681 1.0808
No log 2.0 12 1.1435 0.2457 1.1435 1.0693
No log 2.3333 14 1.1825 0.2636 1.1825 1.0874
No log 2.6667 16 1.1909 0.1565 1.1909 1.0913
No log 3.0 18 1.1008 0.2684 1.1008 1.0492
No log 3.3333 20 1.0194 0.3365 1.0194 1.0096
No log 3.6667 22 0.9784 0.3643 0.9784 0.9891
No log 4.0 24 0.9343 0.3979 0.9343 0.9666
No log 4.3333 26 1.1654 0.2899 1.1654 1.0795
No log 4.6667 28 0.9480 0.4518 0.9480 0.9736
No log 5.0 30 1.0280 0.4088 1.0280 1.0139
No log 5.3333 32 1.1235 0.3481 1.1235 1.0600
No log 5.6667 34 1.1787 0.3481 1.1787 1.0857
No log 6.0 36 1.0717 0.5396 1.0717 1.0352
No log 6.3333 38 1.3762 0.3477 1.3762 1.1731
No log 6.6667 40 1.3696 0.3170 1.3696 1.1703
No log 7.0 42 1.1405 0.4615 1.1405 1.0679
No log 7.3333 44 1.1017 0.4420 1.1017 1.0496
No log 7.6667 46 1.2061 0.2768 1.2061 1.0982
No log 8.0 48 0.8982 0.4885 0.8982 0.9477
No log 8.3333 50 0.8507 0.4741 0.8507 0.9224
No log 8.6667 52 1.0050 0.4570 1.0050 1.0025
No log 9.0 54 0.8438 0.4815 0.8438 0.9186
No log 9.3333 56 0.8233 0.5011 0.8233 0.9073
No log 9.6667 58 0.8505 0.5011 0.8505 0.9222
No log 10.0 60 0.9525 0.4489 0.9525 0.9760
No log 10.3333 62 0.9720 0.4301 0.9720 0.9859
No log 10.6667 64 0.9382 0.4714 0.9382 0.9686
No log 11.0 66 0.9761 0.4467 0.9761 0.9880
No log 11.3333 68 0.9804 0.4620 0.9804 0.9901
No log 11.6667 70 1.1064 0.3978 1.1064 1.0518
No log 12.0 72 1.2772 0.3180 1.2772 1.1301
No log 12.3333 74 1.0863 0.4266 1.0863 1.0422
No log 12.6667 76 0.8631 0.5396 0.8631 0.9290
No log 13.0 78 0.9101 0.4957 0.9101 0.9540
No log 13.3333 80 0.8469 0.5082 0.8469 0.9203
No log 13.6667 82 0.9729 0.4347 0.9729 0.9864
No log 14.0 84 1.1297 0.4057 1.1297 1.0629
No log 14.3333 86 0.9825 0.4746 0.9825 0.9912
No log 14.6667 88 0.9291 0.5220 0.9291 0.9639
No log 15.0 90 0.9998 0.4314 0.9998 0.9999
No log 15.3333 92 0.9120 0.4939 0.9120 0.9550
No log 15.6667 94 0.8889 0.5534 0.8889 0.9428
No log 16.0 96 0.9678 0.5128 0.9678 0.9838
No log 16.3333 98 0.9135 0.5830 0.9135 0.9558
No log 16.6667 100 0.8889 0.4853 0.8889 0.9428
No log 17.0 102 1.1794 0.4339 1.1794 1.0860
No log 17.3333 104 1.2622 0.4478 1.2622 1.1235
No log 17.6667 106 1.0378 0.4557 1.0378 1.0187
No log 18.0 108 0.8709 0.5843 0.8709 0.9332
No log 18.3333 110 0.9244 0.5367 0.9244 0.9615
No log 18.6667 112 0.9024 0.5668 0.9024 0.9499
No log 19.0 114 0.9022 0.4858 0.9022 0.9498
No log 19.3333 116 1.0290 0.4444 1.0290 1.0144
No log 19.6667 118 1.0667 0.4609 1.0667 1.0328
No log 20.0 120 0.9583 0.4772 0.9583 0.9789
No log 20.3333 122 0.8816 0.5305 0.8816 0.9390
No log 20.6667 124 0.8331 0.5315 0.8331 0.9127
No log 21.0 126 0.8447 0.5420 0.8447 0.9191
No log 21.3333 128 0.8319 0.5673 0.8319 0.9121
No log 21.6667 130 0.8862 0.5324 0.8862 0.9414
No log 22.0 132 0.9089 0.5266 0.9089 0.9534
No log 22.3333 134 0.9189 0.5448 0.9189 0.9586
No log 22.6667 136 0.8343 0.5645 0.8343 0.9134
No log 23.0 138 0.8889 0.5785 0.8889 0.9428
No log 23.3333 140 0.9232 0.5642 0.9232 0.9608
No log 23.6667 142 0.8127 0.5382 0.8127 0.9015
No log 24.0 144 0.8171 0.5295 0.8171 0.9039
No log 24.3333 146 0.8229 0.5102 0.8229 0.9072
No log 24.6667 148 0.8250 0.5435 0.8250 0.9083
No log 25.0 150 0.8484 0.5120 0.8484 0.9211
No log 25.3333 152 0.8307 0.5343 0.8307 0.9114
No log 25.6667 154 0.8723 0.5041 0.8723 0.9340
No log 26.0 156 0.9405 0.5227 0.9405 0.9698
No log 26.3333 158 0.9157 0.5332 0.9157 0.9569
No log 26.6667 160 0.8659 0.5704 0.8659 0.9306
No log 27.0 162 0.8983 0.5889 0.8983 0.9478
No log 27.3333 164 0.8750 0.5669 0.8750 0.9354
No log 27.6667 166 0.8514 0.5559 0.8514 0.9227
No log 28.0 168 0.8428 0.4920 0.8428 0.9180
No log 28.3333 170 0.8734 0.5070 0.8734 0.9346
No log 28.6667 172 0.8252 0.4937 0.8252 0.9084
No log 29.0 174 0.8025 0.5364 0.8025 0.8958
No log 29.3333 176 0.8041 0.5387 0.8041 0.8967
No log 29.6667 178 0.8085 0.5043 0.8085 0.8992
No log 30.0 180 0.8109 0.4808 0.8109 0.9005
No log 30.3333 182 0.8258 0.4808 0.8258 0.9087
No log 30.6667 184 0.8129 0.4808 0.8129 0.9016
No log 31.0 186 0.8122 0.4948 0.8122 0.9012
No log 31.3333 188 0.8244 0.4808 0.8244 0.9080
No log 31.6667 190 0.8453 0.5211 0.8453 0.9194
No log 32.0 192 0.8882 0.5451 0.8882 0.9425
No log 32.3333 194 0.8426 0.5125 0.8426 0.9179
No log 32.6667 196 0.8287 0.5743 0.8287 0.9103
No log 33.0 198 0.8271 0.5186 0.8271 0.9095
No log 33.3333 200 0.8577 0.4796 0.8577 0.9261
No log 33.6667 202 0.8666 0.4587 0.8666 0.9309
No log 34.0 204 0.8559 0.4620 0.8559 0.9252
No log 34.3333 206 0.8579 0.5086 0.8579 0.9262
No log 34.6667 208 0.8762 0.5241 0.8762 0.9361
No log 35.0 210 0.8942 0.5232 0.8942 0.9456
No log 35.3333 212 0.9170 0.4996 0.9170 0.9576
No log 35.6667 214 0.9297 0.5246 0.9297 0.9642
No log 36.0 216 0.9332 0.5026 0.9332 0.9660
No log 36.3333 218 0.9238 0.4805 0.9238 0.9611
No log 36.6667 220 0.9224 0.4785 0.9224 0.9604
No log 37.0 222 0.9193 0.4859 0.9193 0.9588
No log 37.3333 224 0.9303 0.5349 0.9303 0.9645
No log 37.6667 226 0.9275 0.5336 0.9275 0.9631
No log 38.0 228 0.9096 0.4980 0.9096 0.9537
No log 38.3333 230 0.8895 0.4852 0.8895 0.9431
No log 38.6667 232 0.8790 0.4499 0.8790 0.9376
No log 39.0 234 0.8964 0.5724 0.8964 0.9468
No log 39.3333 236 0.9354 0.5226 0.9354 0.9672
No log 39.6667 238 0.8821 0.5137 0.8821 0.9392
No log 40.0 240 0.8819 0.5196 0.8819 0.9391
No log 40.3333 242 0.8778 0.5196 0.8778 0.9369
No log 40.6667 244 0.8909 0.4604 0.8909 0.9439
No log 41.0 246 0.9204 0.4328 0.9204 0.9594
No log 41.3333 248 0.9614 0.4076 0.9614 0.9805
No log 41.6667 250 0.9696 0.4235 0.9696 0.9847
No log 42.0 252 0.9496 0.5254 0.9496 0.9745
No log 42.3333 254 0.9497 0.5151 0.9497 0.9745
No log 42.6667 256 0.9655 0.5047 0.9655 0.9826
No log 43.0 258 0.9892 0.5273 0.9892 0.9946
No log 43.3333 260 1.0024 0.4935 1.0024 1.0012
No log 43.6667 262 0.9471 0.4985 0.9471 0.9732
No log 44.0 264 0.9016 0.5041 0.9016 0.9495
No log 44.3333 266 0.8780 0.5290 0.8780 0.9370
No log 44.6667 268 0.8938 0.5476 0.8938 0.9454
No log 45.0 270 0.8987 0.5524 0.8987 0.9480
No log 45.3333 272 0.8722 0.5953 0.8722 0.9339
No log 45.6667 274 0.8862 0.5012 0.8862 0.9414
No log 46.0 276 0.9834 0.5105 0.9834 0.9917
No log 46.3333 278 1.0246 0.4685 1.0246 1.0122
No log 46.6667 280 0.9730 0.5326 0.9730 0.9864
No log 47.0 282 0.8955 0.4920 0.8955 0.9463
No log 47.3333 284 0.8698 0.5843 0.8698 0.9326
No log 47.6667 286 0.8836 0.5495 0.8836 0.9400
No log 48.0 288 0.8652 0.5783 0.8652 0.9302
No log 48.3333 290 0.8348 0.5287 0.8348 0.9137
No log 48.6667 292 0.8729 0.5519 0.8729 0.9343
No log 49.0 294 0.9264 0.5426 0.9264 0.9625
No log 49.3333 296 0.9291 0.5301 0.9291 0.9639
No log 49.6667 298 0.8932 0.5451 0.8932 0.9451
No log 50.0 300 0.8532 0.5458 0.8532 0.9237
No log 50.3333 302 0.8521 0.5658 0.8521 0.9231
No log 50.6667 304 0.8579 0.5658 0.8579 0.9262
No log 51.0 306 0.8705 0.5458 0.8705 0.9330
No log 51.3333 308 0.9012 0.5519 0.9012 0.9493
No log 51.6667 310 0.9201 0.5414 0.9201 0.9592
No log 52.0 312 0.9193 0.5519 0.9193 0.9588
No log 52.3333 314 0.8772 0.5194 0.8772 0.9366
No log 52.6667 316 0.8396 0.5487 0.8396 0.9163
No log 53.0 318 0.8226 0.5621 0.8226 0.9069
No log 53.3333 320 0.8119 0.5753 0.8119 0.9010
No log 53.6667 322 0.8042 0.5137 0.8042 0.8968
No log 54.0 324 0.7995 0.5011 0.7995 0.8942
No log 54.3333 326 0.8007 0.4810 0.8007 0.8948
No log 54.6667 328 0.8034 0.4933 0.8034 0.8963
No log 55.0 330 0.8156 0.4695 0.8156 0.9031
No log 55.3333 332 0.8257 0.4671 0.8257 0.9087
No log 55.6667 334 0.8429 0.4724 0.8429 0.9181
No log 56.0 336 0.8518 0.4724 0.8518 0.9229
No log 56.3333 338 0.8509 0.4724 0.8509 0.9224
No log 56.6667 340 0.8527 0.4724 0.8527 0.9234
No log 57.0 342 0.8574 0.4934 0.8574 0.9259
No log 57.3333 344 0.8495 0.5059 0.8495 0.9217
No log 57.6667 346 0.8513 0.4934 0.8513 0.9227
No log 58.0 348 0.8488 0.4934 0.8488 0.9213
No log 58.3333 350 0.8617 0.4920 0.8617 0.9283
No log 58.6667 352 0.8712 0.4920 0.8712 0.9334
No log 59.0 354 0.8771 0.4796 0.8771 0.9365
No log 59.3333 356 0.8690 0.4808 0.8690 0.9322
No log 59.6667 358 0.8484 0.4934 0.8484 0.9211
No log 60.0 360 0.8257 0.4724 0.8257 0.9087
No log 60.3333 362 0.8043 0.4724 0.8043 0.8968
No log 60.6667 364 0.7858 0.4505 0.7858 0.8865
No log 61.0 366 0.7683 0.5011 0.7683 0.8765
No log 61.3333 368 0.7637 0.5040 0.7637 0.8739
No log 61.6667 370 0.7618 0.5268 0.7618 0.8728
No log 62.0 372 0.7605 0.5268 0.7605 0.8721
No log 62.3333 374 0.7679 0.5621 0.7679 0.8763
No log 62.6667 376 0.7754 0.5098 0.7754 0.8806
No log 63.0 378 0.7892 0.4951 0.7892 0.8884
No log 63.3333 380 0.8025 0.4951 0.8025 0.8958
No log 63.6667 382 0.7911 0.4746 0.7911 0.8895
No log 64.0 384 0.7935 0.4512 0.7935 0.8908
No log 64.3333 386 0.7974 0.4512 0.7974 0.8930
No log 64.6667 388 0.7961 0.4512 0.7961 0.8923
No log 65.0 390 0.7998 0.4512 0.7998 0.8943
No log 65.3333 392 0.8049 0.4512 0.8049 0.8971
No log 65.6667 394 0.8113 0.4512 0.8113 0.9007
No log 66.0 396 0.8070 0.4540 0.8070 0.8984
No log 66.3333 398 0.8050 0.4540 0.8050 0.8972
No log 66.6667 400 0.8017 0.4746 0.8017 0.8954
No log 67.0 402 0.8045 0.4996 0.8045 0.8970
No log 67.3333 404 0.8247 0.5148 0.8247 0.9081
No log 67.6667 406 0.8685 0.5673 0.8685 0.9319
No log 68.0 408 0.9066 0.5394 0.9066 0.9521
No log 68.3333 410 0.9116 0.5458 0.9116 0.9548
No log 68.6667 412 0.9286 0.5455 0.9286 0.9636
No log 69.0 414 0.9304 0.5370 0.9304 0.9646
No log 69.3333 416 0.9045 0.4869 0.9045 0.9511
No log 69.6667 418 0.8743 0.4685 0.8743 0.9350
No log 70.0 420 0.8597 0.4934 0.8597 0.9272
No log 70.3333 422 0.8551 0.4934 0.8551 0.9247
No log 70.6667 424 0.8482 0.4934 0.8482 0.9210
No log 71.0 426 0.8358 0.4934 0.8358 0.9142
No log 71.3333 428 0.8257 0.4746 0.8257 0.9087
No log 71.6667 430 0.8197 0.4746 0.8197 0.9054
No log 72.0 432 0.8176 0.4746 0.8176 0.9042
No log 72.3333 434 0.8178 0.4934 0.8178 0.9043
No log 72.6667 436 0.8169 0.4934 0.8169 0.9038
No log 73.0 438 0.8123 0.4724 0.8123 0.9012
No log 73.3333 440 0.8054 0.4724 0.8054 0.8974
No log 73.6667 442 0.8034 0.4505 0.8034 0.8963
No log 74.0 444 0.8066 0.4373 0.8066 0.8981
No log 74.3333 446 0.8115 0.4595 0.8115 0.9009
No log 74.6667 448 0.8244 0.4595 0.8244 0.9080
No log 75.0 450 0.8425 0.4459 0.8425 0.9179
No log 75.3333 452 0.8356 0.4459 0.8356 0.9141
No log 75.6667 454 0.8302 0.4808 0.8302 0.9111
No log 76.0 456 0.8216 0.4808 0.8216 0.9064
No log 76.3333 458 0.8125 0.4595 0.8125 0.9014
No log 76.6667 460 0.8061 0.4840 0.8061 0.8978
No log 77.0 462 0.8037 0.4840 0.8037 0.8965
No log 77.3333 464 0.8031 0.4859 0.8031 0.8962
No log 77.6667 466 0.8067 0.5057 0.8067 0.8982
No log 78.0 468 0.8063 0.5057 0.8063 0.8979
No log 78.3333 470 0.8026 0.5057 0.8026 0.8959
No log 78.6667 472 0.7966 0.5176 0.7966 0.8925
No log 79.0 474 0.7911 0.4981 0.7911 0.8894
No log 79.3333 476 0.7904 0.5102 0.7904 0.8890
No log 79.6667 478 0.7959 0.5176 0.7959 0.8921
No log 80.0 480 0.8039 0.4951 0.8039 0.8966
No log 80.3333 482 0.8130 0.4808 0.8130 0.9017
No log 80.6667 484 0.8304 0.5230 0.8304 0.9112
No log 81.0 486 0.8418 0.5401 0.8418 0.9175
No log 81.3333 488 0.8513 0.5401 0.8513 0.9226
No log 81.6667 490 0.8541 0.5401 0.8541 0.9242
No log 82.0 492 0.8471 0.5401 0.8471 0.9204
No log 82.3333 494 0.8311 0.4808 0.8311 0.9117
No log 82.6667 496 0.8192 0.4840 0.8192 0.9051
No log 83.0 498 0.8149 0.4965 0.8149 0.9027
0.2013 83.3333 500 0.8101 0.4981 0.8101 0.9000
0.2013 83.6667 502 0.8098 0.4981 0.8098 0.8999
0.2013 84.0 504 0.8102 0.5186 0.8102 0.9001
0.2013 84.3333 506 0.8150 0.5186 0.8150 0.9028
0.2013 84.6667 508 0.8256 0.5148 0.8256 0.9086
0.2013 85.0 510 0.8410 0.5610 0.8410 0.9170
0.2013 85.3333 512 0.8509 0.5401 0.8509 0.9224
0.2013 85.6667 514 0.8498 0.5401 0.8498 0.9218
0.2013 86.0 516 0.8409 0.5401 0.8409 0.9170
0.2013 86.3333 518 0.8277 0.5013 0.8277 0.9098
0.2013 86.6667 520 0.8171 0.4808 0.8171 0.9039
0.2013 87.0 522 0.8123 0.4724 0.8123 0.9013
0.2013 87.3333 524 0.8118 0.4724 0.8118 0.9010
0.2013 87.6667 526 0.8166 0.4595 0.8166 0.9037
0.2013 88.0 528 0.8245 0.4595 0.8245 0.9080

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.4.0+cu118
  • Datasets 2.21.0
  • Tokenizers 0.19.1
Downloads last month
1
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for MayBashendy/ArabicNewSplits7_OSS_usingWellWrittenEssays_FineTuningAraBERT_run1_AugV5_k1_task2_organization

Finetuned
(4023)
this model