ArabicNewSplits7_B_usingALLEssays_FineTuningAraBERT_run2_AugV5_k1_task7_organization

This model is a fine-tuned version of aubmindlab/bert-base-arabertv02 on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 0.5897
  • Qwk: 0.4872
  • Mse: 0.5897
  • Rmse: 0.7680

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 100

Training results

Training Loss Epoch Step Validation Loss Qwk Mse Rmse
No log 0.3333 2 2.3858 -0.0336 2.3858 1.5446
No log 0.6667 4 0.9878 0.2164 0.9878 0.9939
No log 1.0 6 0.7505 0.1232 0.7505 0.8663
No log 1.3333 8 0.8644 0.3347 0.8644 0.9297
No log 1.6667 10 0.9520 0.3280 0.9520 0.9757
No log 2.0 12 0.9115 0.3006 0.9115 0.9547
No log 2.3333 14 1.1386 0.2713 1.1386 1.0670
No log 2.6667 16 0.7439 0.3193 0.7439 0.8625
No log 3.0 18 0.6465 0.4880 0.6465 0.8040
No log 3.3333 20 0.9000 0.3538 0.9000 0.9487
No log 3.6667 22 1.0296 0.3431 1.0296 1.0147
No log 4.0 24 0.7479 0.4222 0.7479 0.8648
No log 4.3333 26 0.5889 0.4044 0.5889 0.7674
No log 4.6667 28 0.6086 0.4427 0.6086 0.7801
No log 5.0 30 0.7250 0.4625 0.7250 0.8515
No log 5.3333 32 0.8539 0.4799 0.8539 0.9241
No log 5.6667 34 0.7584 0.4717 0.7584 0.8709
No log 6.0 36 0.6380 0.4189 0.6380 0.7987
No log 6.3333 38 0.6781 0.4886 0.6781 0.8234
No log 6.6667 40 0.6345 0.4363 0.6345 0.7966
No log 7.0 42 0.8322 0.5504 0.8322 0.9122
No log 7.3333 44 1.0975 0.4149 1.0975 1.0476
No log 7.6667 46 1.1115 0.3298 1.1115 1.0543
No log 8.0 48 0.9627 0.4204 0.9627 0.9812
No log 8.3333 50 0.8154 0.4351 0.8154 0.9030
No log 8.6667 52 0.7552 0.3689 0.7552 0.8690
No log 9.0 54 0.7046 0.3723 0.7046 0.8394
No log 9.3333 56 0.6493 0.4581 0.6493 0.8058
No log 9.6667 58 0.6495 0.4581 0.6495 0.8059
No log 10.0 60 0.7277 0.4155 0.7277 0.8531
No log 10.3333 62 0.7177 0.3958 0.7177 0.8472
No log 10.6667 64 0.8110 0.3402 0.8110 0.9005
No log 11.0 66 0.8415 0.3151 0.8415 0.9174
No log 11.3333 68 0.9310 0.3596 0.9310 0.9649
No log 11.6667 70 1.0779 0.4265 1.0779 1.0382
No log 12.0 72 0.9399 0.4305 0.9399 0.9695
No log 12.3333 74 0.7617 0.4768 0.7617 0.8728
No log 12.6667 76 0.7107 0.4574 0.7107 0.8430
No log 13.0 78 0.6673 0.4574 0.6673 0.8169
No log 13.3333 80 0.7325 0.5059 0.7325 0.8558
No log 13.6667 82 0.6459 0.4678 0.6459 0.8036
No log 14.0 84 0.5605 0.6046 0.5605 0.7487
No log 14.3333 86 0.5434 0.5533 0.5434 0.7371
No log 14.6667 88 0.5817 0.4901 0.5817 0.7627
No log 15.0 90 0.6288 0.3820 0.6288 0.7930
No log 15.3333 92 0.7144 0.4265 0.7144 0.8452
No log 15.6667 94 0.7181 0.4199 0.7181 0.8474
No log 16.0 96 0.5989 0.4625 0.5989 0.7739
No log 16.3333 98 0.5510 0.5015 0.5510 0.7423
No log 16.6667 100 0.5925 0.4854 0.5925 0.7697
No log 17.0 102 0.7862 0.4304 0.7862 0.8867
No log 17.3333 104 0.7899 0.4912 0.7899 0.8888
No log 17.6667 106 0.7486 0.4925 0.7486 0.8652
No log 18.0 108 0.6242 0.5409 0.6242 0.7900
No log 18.3333 110 0.5581 0.5945 0.5581 0.7470
No log 18.6667 112 0.5094 0.6492 0.5094 0.7138
No log 19.0 114 0.5313 0.5692 0.5313 0.7289
No log 19.3333 116 0.5125 0.5692 0.5125 0.7159
No log 19.6667 118 0.4899 0.6184 0.4899 0.6999
No log 20.0 120 0.6819 0.4764 0.6819 0.8258
No log 20.3333 122 0.8378 0.4166 0.8378 0.9153
No log 20.6667 124 0.8252 0.4365 0.8252 0.9084
No log 21.0 126 0.6519 0.3820 0.6519 0.8074
No log 21.3333 128 0.5450 0.4660 0.5450 0.7382
No log 21.6667 130 0.5388 0.4660 0.5388 0.7340
No log 22.0 132 0.5388 0.5159 0.5388 0.7340
No log 22.3333 134 0.5728 0.5115 0.5728 0.7568
No log 22.6667 136 0.5990 0.4928 0.5990 0.7739
No log 23.0 138 0.6608 0.4879 0.6608 0.8129
No log 23.3333 140 0.7560 0.5283 0.7560 0.8695
No log 23.6667 142 0.7004 0.5283 0.7004 0.8369
No log 24.0 144 0.5755 0.4795 0.5755 0.7586
No log 24.3333 146 0.4945 0.5323 0.4945 0.7032
No log 24.6667 148 0.4874 0.6073 0.4874 0.6982
No log 25.0 150 0.5122 0.6073 0.5122 0.7157
No log 25.3333 152 0.6076 0.4471 0.6076 0.7795
No log 25.6667 154 0.7033 0.4520 0.7033 0.8386
No log 26.0 156 0.7932 0.4637 0.7932 0.8906
No log 26.3333 158 0.7693 0.4260 0.7693 0.8771
No log 26.6667 160 0.6549 0.4539 0.6549 0.8093
No log 27.0 162 0.5983 0.5016 0.5983 0.7735
No log 27.3333 164 0.5997 0.5016 0.5997 0.7744
No log 27.6667 166 0.6370 0.5059 0.6370 0.7981
No log 28.0 168 0.6405 0.4862 0.6405 0.8003
No log 28.3333 170 0.5880 0.5133 0.5880 0.7668
No log 28.6667 172 0.6117 0.4460 0.6117 0.7821
No log 29.0 174 0.6066 0.4390 0.6066 0.7789
No log 29.3333 176 0.5589 0.4901 0.5589 0.7476
No log 29.6667 178 0.5783 0.4662 0.5783 0.7604
No log 30.0 180 0.6372 0.5231 0.6372 0.7983
No log 30.3333 182 0.6629 0.5076 0.6629 0.8142
No log 30.6667 184 0.6384 0.5140 0.6384 0.7990
No log 31.0 186 0.6134 0.4860 0.6134 0.7832
No log 31.3333 188 0.5394 0.5288 0.5394 0.7344
No log 31.6667 190 0.5298 0.5888 0.5298 0.7279
No log 32.0 192 0.5579 0.5273 0.5579 0.7469
No log 32.3333 194 0.5748 0.4624 0.5748 0.7582
No log 32.6667 196 0.5365 0.4777 0.5365 0.7325
No log 33.0 198 0.5157 0.5177 0.5157 0.7181
No log 33.3333 200 0.5241 0.5232 0.5241 0.7239
No log 33.6667 202 0.5569 0.4966 0.5569 0.7462
No log 34.0 204 0.6413 0.4700 0.6413 0.8008
No log 34.3333 206 0.6706 0.4700 0.6706 0.8189
No log 34.6667 208 0.6401 0.4550 0.6401 0.8000
No log 35.0 210 0.6075 0.4622 0.6075 0.7794
No log 35.3333 212 0.5634 0.4726 0.5634 0.7506
No log 35.6667 214 0.5566 0.5151 0.5566 0.7461
No log 36.0 216 0.5553 0.4984 0.5553 0.7452
No log 36.3333 218 0.5653 0.4576 0.5653 0.7519
No log 36.6667 220 0.5903 0.4719 0.5903 0.7683
No log 37.0 222 0.5763 0.4795 0.5763 0.7591
No log 37.3333 224 0.6110 0.4925 0.6110 0.7817
No log 37.6667 226 0.5670 0.5030 0.5670 0.7530
No log 38.0 228 0.5089 0.5488 0.5089 0.7134
No log 38.3333 230 0.4920 0.5505 0.4920 0.7014
No log 38.6667 232 0.5013 0.5488 0.5013 0.7080
No log 39.0 234 0.5532 0.4970 0.5532 0.7438
No log 39.3333 236 0.5957 0.4681 0.5957 0.7718
No log 39.6667 238 0.6539 0.5332 0.6539 0.8086
No log 40.0 240 0.6616 0.5294 0.6616 0.8134
No log 40.3333 242 0.6239 0.5484 0.6239 0.7899
No log 40.6667 244 0.5772 0.4681 0.5772 0.7597
No log 41.0 246 0.5306 0.5109 0.5306 0.7284
No log 41.3333 248 0.5215 0.4838 0.5215 0.7221
No log 41.6667 250 0.4928 0.5815 0.4928 0.7020
No log 42.0 252 0.4819 0.6039 0.4819 0.6942
No log 42.3333 254 0.4858 0.6254 0.4858 0.6970
No log 42.6667 256 0.5043 0.5626 0.5043 0.7102
No log 43.0 258 0.5424 0.5276 0.5424 0.7365
No log 43.3333 260 0.5554 0.5459 0.5554 0.7452
No log 43.6667 262 0.5864 0.4837 0.5864 0.7657
No log 44.0 264 0.6466 0.4888 0.6466 0.8041
No log 44.3333 266 0.6646 0.4821 0.6646 0.8152
No log 44.6667 268 0.7126 0.4684 0.7126 0.8441
No log 45.0 270 0.6749 0.4539 0.6749 0.8215
No log 45.3333 272 0.6403 0.4610 0.6403 0.8002
No log 45.6667 274 0.5578 0.4582 0.5578 0.7469
No log 46.0 276 0.5236 0.5517 0.5236 0.7236
No log 46.3333 278 0.5077 0.6129 0.5077 0.7125
No log 46.6667 280 0.4994 0.6046 0.4994 0.7067
No log 47.0 282 0.5097 0.5951 0.5097 0.7139
No log 47.3333 284 0.4922 0.6060 0.4922 0.7016
No log 47.6667 286 0.4766 0.6255 0.4766 0.6903
No log 48.0 288 0.4789 0.6357 0.4789 0.6920
No log 48.3333 290 0.4785 0.6357 0.4785 0.6918
No log 48.6667 292 0.4900 0.5815 0.4900 0.7000
No log 49.0 294 0.5442 0.4883 0.5442 0.7377
No log 49.3333 296 0.6226 0.4845 0.6226 0.7890
No log 49.6667 298 0.6740 0.5023 0.6740 0.8210
No log 50.0 300 0.6982 0.5007 0.6982 0.8356
No log 50.3333 302 0.6563 0.5076 0.6563 0.8101
No log 50.6667 304 0.6016 0.5056 0.6016 0.7757
No log 51.0 306 0.5953 0.4862 0.5953 0.7716
No log 51.3333 308 0.6069 0.4862 0.6069 0.7791
No log 51.6667 310 0.6111 0.4872 0.6111 0.7817
No log 52.0 312 0.6110 0.4644 0.6110 0.7816
No log 52.3333 314 0.6559 0.4335 0.6559 0.8098
No log 52.6667 316 0.7216 0.4404 0.7216 0.8495
No log 53.0 318 0.7832 0.4748 0.7832 0.8850
No log 53.3333 320 0.7955 0.4748 0.7955 0.8919
No log 53.6667 322 0.7391 0.4568 0.7391 0.8597
No log 54.0 324 0.6777 0.4404 0.6777 0.8232
No log 54.3333 326 0.6365 0.4408 0.6365 0.7978
No log 54.6667 328 0.5955 0.4889 0.5955 0.7717
No log 55.0 330 0.5745 0.4681 0.5745 0.7580
No log 55.3333 332 0.5802 0.4889 0.5802 0.7617
No log 55.6667 334 0.6123 0.4408 0.6123 0.7825
No log 56.0 336 0.6771 0.3976 0.6771 0.8228
No log 56.3333 338 0.7210 0.3948 0.7210 0.8491
No log 56.6667 340 0.7092 0.4012 0.7092 0.8421
No log 57.0 342 0.6807 0.3976 0.6807 0.8250
No log 57.3333 344 0.6262 0.4114 0.6262 0.7913
No log 57.6667 346 0.6052 0.4354 0.6052 0.7780
No log 58.0 348 0.6076 0.4354 0.6076 0.7795
No log 58.3333 350 0.6444 0.4335 0.6444 0.8028
No log 58.6667 352 0.6820 0.4335 0.6820 0.8258
No log 59.0 354 0.6752 0.4263 0.6752 0.8217
No log 59.3333 356 0.6965 0.4334 0.6965 0.8346
No log 59.6667 358 0.7191 0.4334 0.7191 0.8480
No log 60.0 360 0.7036 0.4334 0.7036 0.8388
No log 60.3333 362 0.6516 0.4904 0.6516 0.8072
No log 60.6667 364 0.6153 0.4721 0.6153 0.7844
No log 61.0 366 0.5825 0.4740 0.5825 0.7632
No log 61.3333 368 0.5415 0.4543 0.5415 0.7359
No log 61.6667 370 0.5285 0.4719 0.5285 0.7270
No log 62.0 372 0.5295 0.4640 0.5295 0.7276
No log 62.3333 374 0.5425 0.4601 0.5425 0.7366
No log 62.6667 376 0.5748 0.4601 0.5748 0.7581
No log 63.0 378 0.6210 0.4281 0.6210 0.7880
No log 63.3333 380 0.6477 0.4051 0.6477 0.8048
No log 63.6667 382 0.6347 0.4207 0.6347 0.7967
No log 64.0 384 0.6074 0.4281 0.6074 0.7794
No log 64.3333 386 0.6004 0.4281 0.6004 0.7749
No log 64.6667 388 0.5997 0.4281 0.5997 0.7744
No log 65.0 390 0.6165 0.4281 0.6165 0.7852
No log 65.3333 392 0.6517 0.4428 0.6517 0.8073
No log 65.6667 394 0.6810 0.4648 0.6810 0.8252
No log 66.0 396 0.6850 0.4648 0.6850 0.8277
No log 66.3333 398 0.6832 0.4648 0.6832 0.8266
No log 66.6667 400 0.6526 0.4354 0.6526 0.8078
No log 67.0 402 0.6131 0.4207 0.6131 0.7830
No log 67.3333 404 0.5939 0.4523 0.5939 0.7707
No log 67.6667 406 0.5868 0.4740 0.5868 0.7660
No log 68.0 408 0.6004 0.4740 0.6004 0.7748
No log 68.3333 410 0.6009 0.4740 0.6009 0.7752
No log 68.6667 412 0.6122 0.4937 0.6122 0.7825
No log 69.0 414 0.6146 0.4937 0.6146 0.7840
No log 69.3333 416 0.6144 0.4872 0.6144 0.7838
No log 69.6667 418 0.6039 0.4872 0.6039 0.7771
No log 70.0 420 0.5738 0.4951 0.5738 0.7575
No log 70.3333 422 0.5635 0.4889 0.5635 0.7506
No log 70.6667 424 0.5531 0.4758 0.5531 0.7437
No log 71.0 426 0.5573 0.4681 0.5573 0.7466
No log 71.3333 428 0.5685 0.4681 0.5685 0.7540
No log 71.6667 430 0.5670 0.4681 0.5670 0.7530
No log 72.0 432 0.5558 0.5444 0.5558 0.7455
No log 72.3333 434 0.5544 0.5272 0.5544 0.7445
No log 72.6667 436 0.5584 0.4907 0.5584 0.7473
No log 73.0 438 0.5574 0.4907 0.5574 0.7466
No log 73.3333 440 0.5480 0.4907 0.5480 0.7403
No log 73.6667 442 0.5516 0.4681 0.5516 0.7427
No log 74.0 444 0.5621 0.4889 0.5621 0.7498
No log 74.3333 446 0.5767 0.4951 0.5767 0.7594
No log 74.6667 448 0.5845 0.4872 0.5845 0.7645
No log 75.0 450 0.6039 0.4872 0.6039 0.7771
No log 75.3333 452 0.6375 0.4937 0.6375 0.7984
No log 75.6667 454 0.6625 0.4997 0.6625 0.8140
No log 76.0 456 0.6687 0.4997 0.6687 0.8177
No log 76.3333 458 0.6557 0.4997 0.6557 0.8098
No log 76.6667 460 0.6271 0.4937 0.6271 0.7919
No log 77.0 462 0.5946 0.4889 0.5946 0.7711
No log 77.3333 464 0.5773 0.4758 0.5773 0.7598
No log 77.6667 466 0.5740 0.4758 0.5740 0.7577
No log 78.0 468 0.5729 0.4758 0.5729 0.7569
No log 78.3333 470 0.5760 0.4624 0.5760 0.7589
No log 78.6667 472 0.5710 0.4699 0.5710 0.7556
No log 79.0 474 0.5750 0.4624 0.5750 0.7583
No log 79.3333 476 0.5781 0.4624 0.5781 0.7603
No log 79.6667 478 0.5788 0.4624 0.5788 0.7608
No log 80.0 480 0.5730 0.4624 0.5730 0.7570
No log 80.3333 482 0.5623 0.4699 0.5623 0.7499
No log 80.6667 484 0.5566 0.4699 0.5566 0.7461
No log 81.0 486 0.5517 0.4856 0.5517 0.7428
No log 81.3333 488 0.5535 0.4486 0.5535 0.7440
No log 81.6667 490 0.5580 0.4699 0.5580 0.7470
No log 82.0 492 0.5639 0.4681 0.5639 0.7509
No log 82.3333 494 0.5748 0.4740 0.5748 0.7581
No log 82.6667 496 0.5928 0.4872 0.5928 0.7699
No log 83.0 498 0.6054 0.4872 0.6054 0.7781
0.1819 83.3333 500 0.6120 0.4872 0.6120 0.7823
0.1819 83.6667 502 0.6178 0.4872 0.6178 0.7860
0.1819 84.0 504 0.6219 0.4872 0.6219 0.7886
0.1819 84.3333 506 0.6149 0.4872 0.6149 0.7841
0.1819 84.6667 508 0.5995 0.4872 0.5995 0.7743
0.1819 85.0 510 0.5897 0.4872 0.5897 0.7680

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.4.0+cu118
  • Datasets 2.21.0
  • Tokenizers 0.19.1
Downloads last month
-
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for MayBashendy/ArabicNewSplits7_B_usingALLEssays_FineTuningAraBERT_run2_AugV5_k1_task7_organization

Finetuned
(4019)
this model