ArabicNewSplits7_B_usingWellWrittenEssays_FineTuningAraBERT_run2_AugV5_k3_task5_organization

This model is a fine-tuned version of aubmindlab/bert-base-arabertv02 on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 0.5617
  • Qwk: 0.5747
  • Mse: 0.5617
  • Rmse: 0.7495

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 100

Training results

Training Loss Epoch Step Validation Loss Qwk Mse Rmse
No log 0.1176 2 3.9340 -0.0109 3.9340 1.9834
No log 0.2353 4 2.2620 -0.0007 2.2620 1.5040
No log 0.3529 6 1.5065 -0.0078 1.5065 1.2274
No log 0.4706 8 1.1305 0.2271 1.1305 1.0632
No log 0.5882 10 1.2356 0.0880 1.2356 1.1116
No log 0.7059 12 1.3547 0.0135 1.3547 1.1639
No log 0.8235 14 1.1037 0.2217 1.1037 1.0506
No log 0.9412 16 1.0227 0.3014 1.0227 1.0113
No log 1.0588 18 1.3479 0.0827 1.3479 1.1610
No log 1.1765 20 1.5253 0.1600 1.5253 1.2350
No log 1.2941 22 1.1308 0.1675 1.1308 1.0634
No log 1.4118 24 0.7941 0.3688 0.7941 0.8911
No log 1.5294 26 0.7854 0.4533 0.7854 0.8862
No log 1.6471 28 1.0868 0.3276 1.0868 1.0425
No log 1.7647 30 1.6890 0.2314 1.6890 1.2996
No log 1.8824 32 1.6728 0.2424 1.6728 1.2934
No log 2.0 34 1.3673 0.2283 1.3673 1.1693
No log 2.1176 36 1.0830 0.3059 1.0830 1.0407
No log 2.2353 38 0.7718 0.4198 0.7718 0.8785
No log 2.3529 40 0.7695 0.5567 0.7695 0.8772
No log 2.4706 42 0.8134 0.5154 0.8134 0.9019
No log 2.5882 44 0.7296 0.5375 0.7296 0.8541
No log 2.7059 46 0.7591 0.4750 0.7591 0.8713
No log 2.8235 48 0.7732 0.4750 0.7732 0.8793
No log 2.9412 50 0.7752 0.5415 0.7752 0.8805
No log 3.0588 52 0.7245 0.5182 0.7245 0.8512
No log 3.1765 54 0.7214 0.5500 0.7214 0.8494
No log 3.2941 56 0.7020 0.5703 0.7020 0.8379
No log 3.4118 58 0.7459 0.5773 0.7459 0.8637
No log 3.5294 60 0.8393 0.5310 0.8393 0.9161
No log 3.6471 62 0.8177 0.5299 0.8177 0.9043
No log 3.7647 64 0.7530 0.6087 0.7530 0.8678
No log 3.8824 66 0.8089 0.5888 0.8089 0.8994
No log 4.0 68 0.9379 0.5153 0.9379 0.9685
No log 4.1176 70 1.0206 0.5252 1.0206 1.0102
No log 4.2353 72 0.7869 0.5854 0.7869 0.8871
No log 4.3529 74 0.7509 0.5886 0.7509 0.8666
No log 4.4706 76 0.7363 0.6280 0.7363 0.8581
No log 4.5882 78 0.7195 0.6186 0.7195 0.8482
No log 4.7059 80 1.0920 0.5181 1.0920 1.0450
No log 4.8235 82 1.4364 0.3302 1.4364 1.1985
No log 4.9412 84 1.2713 0.4085 1.2713 1.1275
No log 5.0588 86 0.9124 0.4969 0.9124 0.9552
No log 5.1765 88 0.6870 0.5425 0.6870 0.8289
No log 5.2941 90 0.6706 0.5618 0.6706 0.8189
No log 5.4118 92 0.6832 0.6169 0.6832 0.8266
No log 5.5294 94 0.6876 0.6544 0.6876 0.8292
No log 5.6471 96 0.8071 0.6119 0.8071 0.8984
No log 5.7647 98 0.8984 0.5562 0.8984 0.9479
No log 5.8824 100 0.8176 0.5839 0.8176 0.9042
No log 6.0 102 0.6799 0.6479 0.6799 0.8245
No log 6.1176 104 0.6207 0.5771 0.6207 0.7878
No log 6.2353 106 0.6199 0.5783 0.6199 0.7873
No log 6.3529 108 0.6447 0.6564 0.6447 0.8030
No log 6.4706 110 0.7398 0.6427 0.7398 0.8601
No log 6.5882 112 0.9479 0.4860 0.9479 0.9736
No log 6.7059 114 0.9660 0.4860 0.9660 0.9828
No log 6.8235 116 0.7709 0.6214 0.7709 0.8780
No log 6.9412 118 0.7251 0.5958 0.7251 0.8515
No log 7.0588 120 0.7347 0.5958 0.7347 0.8571
No log 7.1765 122 0.7127 0.5646 0.7127 0.8442
No log 7.2941 124 0.8080 0.6262 0.8080 0.8989
No log 7.4118 126 0.8325 0.5653 0.8325 0.9124
No log 7.5294 128 0.7309 0.5633 0.7309 0.8549
No log 7.6471 130 0.6636 0.6157 0.6636 0.8146
No log 7.7647 132 0.6966 0.5469 0.6966 0.8346
No log 7.8824 134 0.6830 0.6083 0.6830 0.8264
No log 8.0 136 0.6795 0.6374 0.6795 0.8243
No log 8.1176 138 0.6980 0.6206 0.6980 0.8355
No log 8.2353 140 0.7020 0.5877 0.7020 0.8379
No log 8.3529 142 0.7480 0.5809 0.7480 0.8649
No log 8.4706 144 0.7431 0.6045 0.7431 0.8620
No log 8.5882 146 0.7271 0.6157 0.7271 0.8527
No log 8.7059 148 0.7203 0.5774 0.7203 0.8487
No log 8.8235 150 0.7132 0.5985 0.7132 0.8445
No log 8.9412 152 0.7300 0.6115 0.7300 0.8544
No log 9.0588 154 0.7626 0.5803 0.7626 0.8733
No log 9.1765 156 0.7619 0.5803 0.7619 0.8728
No log 9.2941 158 0.8106 0.5590 0.8106 0.9003
No log 9.4118 160 0.7906 0.5374 0.7906 0.8891
No log 9.5294 162 0.7786 0.5461 0.7786 0.8824
No log 9.6471 164 0.7920 0.5673 0.7920 0.8899
No log 9.7647 166 0.8554 0.5329 0.8554 0.9249
No log 9.8824 168 0.8108 0.5142 0.8108 0.9004
No log 10.0 170 0.7690 0.5145 0.7690 0.8769
No log 10.1176 172 0.7174 0.5230 0.7174 0.8470
No log 10.2353 174 0.6810 0.5542 0.6810 0.8253
No log 10.3529 176 0.6945 0.6358 0.6945 0.8334
No log 10.4706 178 0.7354 0.5566 0.7354 0.8575
No log 10.5882 180 0.7831 0.5479 0.7831 0.8849
No log 10.7059 182 0.8052 0.5415 0.8052 0.8973
No log 10.8235 184 0.7596 0.5530 0.7596 0.8716
No log 10.9412 186 0.7017 0.5184 0.7017 0.8377
No log 11.0588 188 0.6732 0.4912 0.6732 0.8205
No log 11.1765 190 0.6757 0.4932 0.6757 0.8220
No log 11.2941 192 0.7124 0.5108 0.7124 0.8440
No log 11.4118 194 0.7669 0.5025 0.7669 0.8758
No log 11.5294 196 0.7557 0.5257 0.7557 0.8693
No log 11.6471 198 0.6982 0.5459 0.6982 0.8356
No log 11.7647 200 0.6462 0.5828 0.6462 0.8039
No log 11.8824 202 0.6311 0.6570 0.6311 0.7944
No log 12.0 204 0.6909 0.6035 0.6909 0.8312
No log 12.1176 206 0.7469 0.5770 0.7469 0.8642
No log 12.2353 208 0.7088 0.6116 0.7088 0.8419
No log 12.3529 210 0.6528 0.6125 0.6528 0.8080
No log 12.4706 212 0.6316 0.7129 0.6316 0.7948
No log 12.5882 214 0.6279 0.7129 0.6279 0.7924
No log 12.7059 216 0.6665 0.6116 0.6665 0.8164
No log 12.8235 218 0.8851 0.5183 0.8851 0.9408
No log 12.9412 220 1.0464 0.5199 1.0464 1.0229
No log 13.0588 222 0.9909 0.5461 0.9909 0.9954
No log 13.1765 224 0.8203 0.5432 0.8203 0.9057
No log 13.2941 226 0.6607 0.6194 0.6607 0.8128
No log 13.4118 228 0.6252 0.6435 0.6252 0.7907
No log 13.5294 230 0.6383 0.6444 0.6383 0.7989
No log 13.6471 232 0.6865 0.6003 0.6865 0.8285
No log 13.7647 234 0.6732 0.6500 0.6732 0.8205
No log 13.8824 236 0.6496 0.6550 0.6496 0.8060
No log 14.0 238 0.6465 0.6518 0.6465 0.8040
No log 14.1176 240 0.6403 0.6430 0.6403 0.8002
No log 14.2353 242 0.6534 0.6269 0.6534 0.8083
No log 14.3529 244 0.7041 0.6065 0.7041 0.8391
No log 14.4706 246 0.8063 0.5759 0.8063 0.8979
No log 14.5882 248 0.7897 0.5462 0.7897 0.8887
No log 14.7059 250 0.6920 0.5334 0.6920 0.8318
No log 14.8235 252 0.6229 0.6508 0.6229 0.7892
No log 14.9412 254 0.6039 0.6729 0.6039 0.7771
No log 15.0588 256 0.5987 0.6822 0.5987 0.7738
No log 15.1765 258 0.6169 0.6452 0.6169 0.7854
No log 15.2941 260 0.7122 0.5659 0.7122 0.8439
No log 15.4118 262 0.9219 0.5181 0.9219 0.9602
No log 15.5294 264 0.9418 0.5181 0.9418 0.9705
No log 15.6471 266 0.8132 0.5365 0.8132 0.9018
No log 15.7647 268 0.6780 0.5922 0.6780 0.8234
No log 15.8824 270 0.6431 0.6491 0.6431 0.8019
No log 16.0 272 0.6456 0.6254 0.6456 0.8035
No log 16.1176 274 0.6446 0.6068 0.6446 0.8028
No log 16.2353 276 0.6599 0.5930 0.6599 0.8123
No log 16.3529 278 0.7735 0.5661 0.7735 0.8795
No log 16.4706 280 0.8113 0.5370 0.8113 0.9007
No log 16.5882 282 0.7364 0.5471 0.7364 0.8581
No log 16.7059 284 0.6450 0.5425 0.6450 0.8031
No log 16.8235 286 0.6260 0.5713 0.6260 0.7912
No log 16.9412 288 0.6150 0.5990 0.6150 0.7842
No log 17.0588 290 0.6029 0.6094 0.6029 0.7764
No log 17.1765 292 0.5903 0.6442 0.5903 0.7683
No log 17.2941 294 0.6083 0.6812 0.6083 0.7799
No log 17.4118 296 0.6629 0.6633 0.6629 0.8142
No log 17.5294 298 0.7789 0.5772 0.7789 0.8826
No log 17.6471 300 0.7843 0.5696 0.7843 0.8856
No log 17.7647 302 0.6708 0.6108 0.6708 0.8190
No log 17.8824 304 0.5941 0.6675 0.5941 0.7708
No log 18.0 306 0.5973 0.6400 0.5973 0.7728
No log 18.1176 308 0.6045 0.5977 0.6045 0.7775
No log 18.2353 310 0.6174 0.6174 0.6174 0.7857
No log 18.3529 312 0.6813 0.6253 0.6813 0.8254
No log 18.4706 314 0.8429 0.5526 0.8429 0.9181
No log 18.5882 316 0.8875 0.5522 0.8875 0.9421
No log 18.7059 318 0.7938 0.5412 0.7938 0.8909
No log 18.8235 320 0.7208 0.5435 0.7208 0.8490
No log 18.9412 322 0.6442 0.5879 0.6442 0.8026
No log 19.0588 324 0.6348 0.5485 0.6348 0.7968
No log 19.1765 326 0.6493 0.5274 0.6493 0.8058
No log 19.2941 328 0.6841 0.4723 0.6841 0.8271
No log 19.4118 330 0.7738 0.5012 0.7738 0.8797
No log 19.5294 332 0.8647 0.5075 0.8647 0.9299
No log 19.6471 334 0.8434 0.5363 0.8434 0.9184
No log 19.7647 336 0.7259 0.5223 0.7259 0.8520
No log 19.8824 338 0.6396 0.6606 0.6396 0.7997
No log 20.0 340 0.6628 0.6468 0.6628 0.8141
No log 20.1176 342 0.6979 0.6479 0.6979 0.8354
No log 20.2353 344 0.6690 0.6562 0.6690 0.8179
No log 20.3529 346 0.6261 0.6536 0.6261 0.7913
No log 20.4706 348 0.6202 0.6491 0.6202 0.7875
No log 20.5882 350 0.6497 0.5966 0.6497 0.8061
No log 20.7059 352 0.7188 0.5656 0.7188 0.8478
No log 20.8235 354 0.7431 0.6221 0.7431 0.8621
No log 20.9412 356 0.7066 0.5922 0.7066 0.8406
No log 21.0588 358 0.6606 0.6302 0.6606 0.8128
No log 21.1765 360 0.6477 0.6115 0.6477 0.8048
No log 21.2941 362 0.6421 0.6237 0.6421 0.8013
No log 21.4118 364 0.6423 0.6039 0.6423 0.8014
No log 21.5294 366 0.6325 0.6039 0.6325 0.7953
No log 21.6471 368 0.6215 0.5843 0.6215 0.7884
No log 21.7647 370 0.6255 0.5843 0.6255 0.7909
No log 21.8824 372 0.6137 0.5736 0.6137 0.7834
No log 22.0 374 0.6245 0.5971 0.6245 0.7902
No log 22.1176 376 0.6243 0.6174 0.6243 0.7902
No log 22.2353 378 0.6347 0.6234 0.6347 0.7967
No log 22.3529 380 0.6273 0.6321 0.6273 0.7920
No log 22.4706 382 0.6241 0.6661 0.6241 0.7900
No log 22.5882 384 0.6175 0.6795 0.6175 0.7858
No log 22.7059 386 0.6237 0.6561 0.6237 0.7897
No log 22.8235 388 0.6483 0.6386 0.6483 0.8052
No log 22.9412 390 0.6063 0.6184 0.6063 0.7786
No log 23.0588 392 0.5932 0.5865 0.5932 0.7702
No log 23.1765 394 0.5834 0.5830 0.5834 0.7638
No log 23.2941 396 0.5924 0.6174 0.5924 0.7697
No log 23.4118 398 0.6218 0.6265 0.6218 0.7886
No log 23.5294 400 0.6628 0.6194 0.6628 0.8141
No log 23.6471 402 0.6605 0.6080 0.6605 0.8127
No log 23.7647 404 0.6185 0.6368 0.6185 0.7865
No log 23.8824 406 0.6015 0.6042 0.6015 0.7756
No log 24.0 408 0.6012 0.6246 0.6012 0.7754
No log 24.1176 410 0.5998 0.6049 0.5998 0.7745
No log 24.2353 412 0.6112 0.6078 0.6112 0.7818
No log 24.3529 414 0.6561 0.6021 0.6561 0.8100
No log 24.4706 416 0.6765 0.6226 0.6765 0.8225
No log 24.5882 418 0.6633 0.6021 0.6633 0.8145
No log 24.7059 420 0.6345 0.5662 0.6345 0.7965
No log 24.8235 422 0.6291 0.5662 0.6291 0.7932
No log 24.9412 424 0.6438 0.6010 0.6438 0.8024
No log 25.0588 426 0.6344 0.5575 0.6344 0.7965
No log 25.1765 428 0.6283 0.5510 0.6283 0.7926
No log 25.2941 430 0.6316 0.5622 0.6316 0.7947
No log 25.4118 432 0.6334 0.5622 0.6334 0.7959
No log 25.5294 434 0.6359 0.5622 0.6359 0.7974
No log 25.6471 436 0.6277 0.5622 0.6277 0.7923
No log 25.7647 438 0.6205 0.6316 0.6205 0.7877
No log 25.8824 440 0.6302 0.6518 0.6302 0.7939
No log 26.0 442 0.6336 0.6423 0.6336 0.7960
No log 26.1176 444 0.6285 0.6195 0.6285 0.7928
No log 26.2353 446 0.6328 0.5888 0.6328 0.7955
No log 26.3529 448 0.6459 0.5887 0.6459 0.8037
No log 26.4706 450 0.6846 0.5913 0.6846 0.8274
No log 26.5882 452 0.6765 0.5117 0.6765 0.8225
No log 26.7059 454 0.6407 0.5217 0.6407 0.8005
No log 26.8235 456 0.6026 0.6078 0.6026 0.7763
No log 26.9412 458 0.5935 0.6916 0.5935 0.7704
No log 27.0588 460 0.6013 0.7006 0.6013 0.7755
No log 27.1765 462 0.6226 0.6767 0.6226 0.7890
No log 27.2941 464 0.6461 0.6610 0.6461 0.8038
No log 27.4118 466 0.6632 0.6795 0.6632 0.8143
No log 27.5294 468 0.7171 0.6283 0.7171 0.8468
No log 27.6471 470 0.7274 0.5832 0.7274 0.8529
No log 27.7647 472 0.6911 0.6163 0.6911 0.8313
No log 27.8824 474 0.6444 0.6109 0.6444 0.8027
No log 28.0 476 0.6171 0.5770 0.6171 0.7855
No log 28.1176 478 0.5981 0.6042 0.5981 0.7734
No log 28.2353 480 0.5932 0.6697 0.5932 0.7702
No log 28.3529 482 0.6041 0.6578 0.6041 0.7772
No log 28.4706 484 0.6487 0.6854 0.6487 0.8054
No log 28.5882 486 0.7140 0.6344 0.7140 0.8450
No log 28.7059 488 0.7433 0.5889 0.7433 0.8621
No log 28.8235 490 0.7446 0.5889 0.7446 0.8629
No log 28.9412 492 0.7246 0.5902 0.7246 0.8512
No log 29.0588 494 0.6735 0.6055 0.6735 0.8207
No log 29.1765 496 0.6180 0.5554 0.6180 0.7861
No log 29.2941 498 0.6061 0.6164 0.6061 0.7785
0.2679 29.4118 500 0.6206 0.6143 0.6206 0.7878
0.2679 29.5294 502 0.6240 0.5419 0.6240 0.7899
0.2679 29.6471 504 0.6146 0.5302 0.6146 0.7840
0.2679 29.7647 506 0.6025 0.5659 0.6025 0.7762
0.2679 29.8824 508 0.6139 0.5890 0.6139 0.7835
0.2679 30.0 510 0.6507 0.6173 0.6507 0.8067
0.2679 30.1176 512 0.6760 0.6055 0.6760 0.8222
0.2679 30.2353 514 0.6582 0.6045 0.6582 0.8113
0.2679 30.3529 516 0.6561 0.6299 0.6561 0.8100
0.2679 30.4706 518 0.6277 0.6779 0.6277 0.7923
0.2679 30.5882 520 0.6117 0.6973 0.6117 0.7821
0.2679 30.7059 522 0.6080 0.6804 0.6080 0.7797
0.2679 30.8235 524 0.6129 0.6779 0.6129 0.7829
0.2679 30.9412 526 0.6086 0.6779 0.6086 0.7801
0.2679 31.0588 528 0.6148 0.6945 0.6148 0.7841
0.2679 31.1765 530 0.6243 0.7040 0.6243 0.7901
0.2679 31.2941 532 0.6351 0.6985 0.6351 0.7969
0.2679 31.4118 534 0.6377 0.6377 0.6377 0.7985
0.2679 31.5294 536 0.6194 0.6377 0.6194 0.7870
0.2679 31.6471 538 0.6115 0.6891 0.6115 0.7820
0.2679 31.7647 540 0.6076 0.7210 0.6076 0.7795
0.2679 31.8824 542 0.6082 0.7210 0.6082 0.7798
0.2679 32.0 544 0.6034 0.7054 0.6034 0.7768
0.2679 32.1176 546 0.6026 0.7092 0.6026 0.7763
0.2679 32.2353 548 0.6040 0.6795 0.6040 0.7771
0.2679 32.3529 550 0.5945 0.6620 0.5945 0.7711
0.2679 32.4706 552 0.5872 0.6620 0.5872 0.7663
0.2679 32.5882 554 0.5734 0.6536 0.5734 0.7572
0.2679 32.7059 556 0.5659 0.5868 0.5659 0.7522
0.2679 32.8235 558 0.5649 0.5868 0.5649 0.7516
0.2679 32.9412 560 0.5642 0.5868 0.5642 0.7512
0.2679 33.0588 562 0.5638 0.5747 0.5638 0.7509
0.2679 33.1765 564 0.5617 0.5747 0.5617 0.7495

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.4.0+cu118
  • Datasets 2.21.0
  • Tokenizers 0.19.1
Downloads last month
-
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for MayBashendy/ArabicNewSplits7_B_usingWellWrittenEssays_FineTuningAraBERT_run2_AugV5_k3_task5_organization

Finetuned
(4033)
this model