ArabicNewSplits7_usingWellWrittenEssays_FineTuningAraBERT_run1_AugV5_k3_task3_organization

This model is a fine-tuned version of aubmindlab/bert-base-arabertv02 on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 0.8278
  • Qwk: -0.1094
  • Mse: 0.8278
  • Rmse: 0.9098

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 100

Training results

Training Loss Epoch Step Validation Loss Qwk Mse Rmse
No log 0.25 2 3.6274 -0.0047 3.6274 1.9046
No log 0.5 4 2.0265 0.0265 2.0265 1.4235
No log 0.75 6 1.4598 -0.0014 1.4598 1.2082
No log 1.0 8 0.8293 0.0017 0.8293 0.9107
No log 1.25 10 0.8043 0.0099 0.8043 0.8968
No log 1.5 12 0.9092 0.0067 0.9092 0.9535
No log 1.75 14 0.9999 0.0217 0.9999 1.0000
No log 2.0 16 0.9134 -0.0163 0.9134 0.9557
No log 2.25 18 0.8730 0.0748 0.8730 0.9344
No log 2.5 20 0.8095 -0.0351 0.8095 0.8997
No log 2.75 22 0.8928 -0.0842 0.8928 0.9449
No log 3.0 24 1.0318 -0.0133 1.0318 1.0158
No log 3.25 26 0.9121 -0.1665 0.9121 0.9550
No log 3.5 28 0.9068 -0.1263 0.9068 0.9523
No log 3.75 30 0.7923 0.0260 0.7923 0.8901
No log 4.0 32 0.7809 -0.1067 0.7809 0.8837
No log 4.25 34 0.7730 -0.1067 0.7730 0.8792
No log 4.5 36 0.8966 -0.1259 0.8966 0.9469
No log 4.75 38 1.1123 -0.0331 1.1123 1.0547
No log 5.0 40 1.0604 -0.0236 1.0604 1.0298
No log 5.25 42 0.8747 -0.0778 0.8747 0.9353
No log 5.5 44 0.8332 -0.0578 0.8332 0.9128
No log 5.75 46 0.8581 -0.0930 0.8581 0.9264
No log 6.0 48 0.9580 -0.0669 0.9580 0.9788
No log 6.25 50 0.9817 -0.1447 0.9817 0.9908
No log 6.5 52 0.9185 0.0347 0.9185 0.9584
No log 6.75 54 0.9247 0.0656 0.9247 0.9616
No log 7.0 56 0.9201 -0.0138 0.9201 0.9592
No log 7.25 58 0.9291 0.1079 0.9291 0.9639
No log 7.5 60 0.9083 -0.1520 0.9083 0.9531
No log 7.75 62 0.9174 -0.1015 0.9174 0.9578
No log 8.0 64 0.9556 -0.1200 0.9556 0.9776
No log 8.25 66 0.9443 -0.0646 0.9443 0.9718
No log 8.5 68 0.9557 -0.0036 0.9557 0.9776
No log 8.75 70 1.0770 -0.0604 1.0770 1.0378
No log 9.0 72 1.0896 -0.0927 1.0896 1.0439
No log 9.25 74 1.0134 -0.0682 1.0134 1.0067
No log 9.5 76 0.9787 -0.0484 0.9787 0.9893
No log 9.75 78 1.0527 -0.0436 1.0527 1.0260
No log 10.0 80 0.9680 -0.0801 0.9680 0.9839
No log 10.25 82 0.9118 -0.0831 0.9118 0.9549
No log 10.5 84 0.8928 -0.1394 0.8928 0.9449
No log 10.75 86 0.8355 0.0061 0.8355 0.9141
No log 11.0 88 0.8365 -0.0849 0.8365 0.9146
No log 11.25 90 0.8556 0.0585 0.8556 0.9250
No log 11.5 92 0.8892 -0.0334 0.8892 0.9430
No log 11.75 94 0.9484 -0.0995 0.9484 0.9739
No log 12.0 96 0.9600 -0.0261 0.9600 0.9798
No log 12.25 98 1.0311 0.0820 1.0311 1.0154
No log 12.5 100 1.0269 0.0816 1.0269 1.0133
No log 12.75 102 0.9303 0.1115 0.9303 0.9645
No log 13.0 104 0.9176 -0.0939 0.9176 0.9579
No log 13.25 106 0.9382 -0.0283 0.9382 0.9686
No log 13.5 108 0.8240 0.0395 0.8240 0.9077
No log 13.75 110 0.8202 -0.0939 0.8202 0.9057
No log 14.0 112 0.8342 -0.0738 0.8342 0.9133
No log 14.25 114 0.7890 -0.0520 0.7890 0.8883
No log 14.5 116 0.7607 -0.0520 0.7607 0.8722
No log 14.75 118 0.7493 -0.0520 0.7493 0.8656
No log 15.0 120 0.7601 -0.0520 0.7601 0.8719
No log 15.25 122 0.7904 -0.0449 0.7904 0.8890
No log 15.5 124 0.8241 0.0633 0.8241 0.9078
No log 15.75 126 0.8553 0.0714 0.8553 0.9248
No log 16.0 128 0.8762 0.1078 0.8762 0.9360
No log 16.25 130 0.8612 0.1115 0.8612 0.9280
No log 16.5 132 0.8202 0.0240 0.8202 0.9057
No log 16.75 134 0.7622 0.0089 0.7622 0.8731
No log 17.0 136 0.7711 0.0148 0.7711 0.8781
No log 17.25 138 0.7662 0.0 0.7662 0.8753
No log 17.5 140 0.7809 0.0 0.7809 0.8837
No log 17.75 142 0.7846 0.0 0.7846 0.8858
No log 18.0 144 0.8382 0.0662 0.8382 0.9155
No log 18.25 146 0.8752 -0.0551 0.8752 0.9355
No log 18.5 148 0.8785 -0.0121 0.8785 0.9373
No log 18.75 150 0.8410 -0.0271 0.8410 0.9171
No log 19.0 152 0.8634 -0.0132 0.8634 0.9292
No log 19.25 154 0.8838 -0.0898 0.8838 0.9401
No log 19.5 156 0.9054 0.0682 0.9054 0.9515
No log 19.75 158 0.9242 0.0420 0.9242 0.9613
No log 20.0 160 0.8770 0.0763 0.8770 0.9365
No log 20.25 162 0.8742 -0.0181 0.8742 0.9350
No log 20.5 164 0.8577 -0.0307 0.8577 0.9261
No log 20.75 166 0.8509 -0.0113 0.8509 0.9225
No log 21.0 168 0.8595 -0.0113 0.8595 0.9271
No log 21.25 170 0.8981 0.0249 0.8981 0.9477
No log 21.5 172 0.9339 0.1155 0.9339 0.9664
No log 21.75 174 0.8723 -0.0150 0.8723 0.9340
No log 22.0 176 0.8330 -0.0633 0.8330 0.9127
No log 22.25 178 0.8515 0.0181 0.8515 0.9227
No log 22.5 180 0.8352 -0.0506 0.8352 0.9139
No log 22.75 182 0.8598 0.0225 0.8598 0.9272
No log 23.0 184 0.8327 -0.0152 0.8327 0.9125
No log 23.25 186 0.8589 0.0297 0.8589 0.9268
No log 23.5 188 0.9164 0.0392 0.9164 0.9573
No log 23.75 190 0.8940 -0.0016 0.8940 0.9455
No log 24.0 192 0.8080 -0.0826 0.8080 0.8989
No log 24.25 194 0.7802 -0.0473 0.7802 0.8833
No log 24.5 196 0.7966 0.0 0.7966 0.8925
No log 24.75 198 0.8271 -0.0406 0.8271 0.9094
No log 25.0 200 0.8831 0.1123 0.8831 0.9397
No log 25.25 202 0.9096 0.1525 0.9096 0.9537
No log 25.5 204 0.8619 0.2038 0.8619 0.9284
No log 25.75 206 0.8134 -0.0595 0.8134 0.9019
No log 26.0 208 0.8075 -0.0628 0.8075 0.8986
No log 26.25 210 0.7910 -0.0595 0.7910 0.8894
No log 26.5 212 0.8153 0.1133 0.8153 0.9029
No log 26.75 214 0.8673 0.1128 0.8673 0.9313
No log 27.0 216 0.8500 0.1034 0.8500 0.9219
No log 27.25 218 0.8368 -0.0350 0.8368 0.9148
No log 27.5 220 0.8158 -0.0427 0.8158 0.9032
No log 27.75 222 0.7850 -0.0541 0.7850 0.8860
No log 28.0 224 0.7822 -0.0541 0.7822 0.8844
No log 28.25 226 0.7875 -0.0541 0.7875 0.8874
No log 28.5 228 0.8087 0.0598 0.8087 0.8993
No log 28.75 230 0.8221 0.0652 0.8221 0.9067
No log 29.0 232 0.8071 0.0148 0.8071 0.8984
No log 29.25 234 0.7654 -0.1094 0.7654 0.8749
No log 29.5 236 0.7610 -0.0188 0.7610 0.8723
No log 29.75 238 0.7519 -0.0160 0.7519 0.8671
No log 30.0 240 0.7562 -0.0541 0.7562 0.8696
No log 30.25 242 0.8002 0.0031 0.8002 0.8945
No log 30.5 244 0.8327 0.1080 0.8327 0.9125
No log 30.75 246 0.8411 0.1080 0.8411 0.9171
No log 31.0 248 0.8275 0.1080 0.8275 0.9097
No log 31.25 250 0.8072 0.0229 0.8072 0.8984
No log 31.5 252 0.7963 0.0229 0.7963 0.8923
No log 31.75 254 0.7782 0.0031 0.7782 0.8821
No log 32.0 256 0.7778 -0.0541 0.7778 0.8819
No log 32.25 258 0.7769 -0.0541 0.7769 0.8814
No log 32.5 260 0.7928 0.0571 0.7928 0.8904
No log 32.75 262 0.8030 0.0585 0.8030 0.8961
No log 33.0 264 0.8059 0.0061 0.8059 0.8977
No log 33.25 266 0.8344 -0.1033 0.8344 0.9135
No log 33.5 268 0.8654 0.0165 0.8654 0.9303
No log 33.75 270 0.9025 0.1590 0.9025 0.9500
No log 34.0 272 0.8795 0.1171 0.8795 0.9378
No log 34.25 274 0.8147 0.0155 0.8147 0.9026
No log 34.5 276 0.7813 -0.1074 0.7813 0.8839
No log 34.75 278 0.7744 -0.0062 0.7744 0.8800
No log 35.0 280 0.8259 -0.0030 0.8259 0.9088
No log 35.25 282 0.8972 0.0570 0.8972 0.9472
No log 35.5 284 0.9641 -0.0393 0.9641 0.9819
No log 35.75 286 0.9531 0.0089 0.9531 0.9763
No log 36.0 288 0.8530 -0.0237 0.8530 0.9236
No log 36.25 290 0.7496 -0.0541 0.7496 0.8658
No log 36.5 292 0.7245 0.0479 0.7245 0.8512
No log 36.75 294 0.7858 -0.0188 0.7858 0.8864
No log 37.0 296 0.8333 -0.0675 0.8333 0.9128
No log 37.25 298 0.8493 -0.0675 0.8493 0.9216
No log 37.5 300 0.8520 0.0081 0.8520 0.9230
No log 37.75 302 0.9027 0.0377 0.9027 0.9501
No log 38.0 304 0.9506 0.1152 0.9506 0.9750
No log 38.25 306 0.9406 0.0800 0.9406 0.9698
No log 38.5 308 0.8885 0.0328 0.8885 0.9426
No log 38.75 310 0.8579 0.0058 0.8579 0.9262
No log 39.0 312 0.8997 -0.0658 0.8997 0.9485
No log 39.25 314 0.8878 -0.1116 0.8878 0.9423
No log 39.5 316 0.8498 -0.0999 0.8498 0.9218
No log 39.75 318 0.8339 0.0571 0.8339 0.9132
No log 40.0 320 0.8561 0.0089 0.8561 0.9253
No log 40.25 322 0.8552 0.0030 0.8552 0.9248
No log 40.5 324 0.8574 -0.1457 0.8574 0.9260
No log 40.75 326 0.8874 -0.0718 0.8874 0.9420
No log 41.0 328 0.8895 -0.0731 0.8895 0.9431
No log 41.25 330 0.8843 -0.0704 0.8843 0.9404
No log 41.5 332 0.8710 -0.0889 0.8710 0.9333
No log 41.75 334 0.8902 -0.0705 0.8902 0.9435
No log 42.0 336 0.9185 0.0702 0.9185 0.9584
No log 42.25 338 0.8792 0.0683 0.8792 0.9377
No log 42.5 340 0.8116 -0.1010 0.8116 0.9009
No log 42.75 342 0.8011 -0.0032 0.8011 0.8951
No log 43.0 344 0.8034 0.0 0.8034 0.8964
No log 43.25 346 0.8141 -0.0513 0.8141 0.9023
No log 43.5 348 0.8108 -0.0513 0.8108 0.9005
No log 43.75 350 0.8316 -0.0513 0.8316 0.9119
No log 44.0 352 0.8325 -0.0513 0.8325 0.9124
No log 44.25 354 0.8106 -0.0513 0.8106 0.9004
No log 44.5 356 0.7858 0.0 0.7858 0.8864
No log 44.75 358 0.7702 -0.1067 0.7702 0.8776
No log 45.0 360 0.7669 -0.1067 0.7669 0.8757
No log 45.25 362 0.8131 -0.1001 0.8131 0.9017
No log 45.5 364 0.8415 -0.0881 0.8415 0.9173
No log 45.75 366 0.8468 -0.0427 0.8468 0.9202
No log 46.0 368 0.8424 -0.0550 0.8424 0.9178
No log 46.25 370 0.8458 -0.0113 0.8458 0.9197
No log 46.5 372 0.8459 -0.0427 0.8459 0.9197
No log 46.75 374 0.8451 -0.0892 0.8451 0.9193
No log 47.0 376 0.8146 -0.1074 0.8146 0.9026
No log 47.25 378 0.7828 -0.0541 0.7828 0.8848
No log 47.5 380 0.7902 0.0436 0.7902 0.8889
No log 47.75 382 0.7985 -0.0240 0.7985 0.8936
No log 48.0 384 0.7910 -0.0690 0.7910 0.8894
No log 48.25 386 0.7964 -0.0091 0.7964 0.8924
No log 48.5 388 0.8318 -0.1074 0.8318 0.9121
No log 48.75 390 0.8657 -0.0307 0.8657 0.9304
No log 49.0 392 0.8863 0.0251 0.8863 0.9414
No log 49.25 394 0.9006 0.0328 0.9006 0.9490
No log 49.5 396 0.8827 -0.0173 0.8827 0.9395
No log 49.75 398 0.8356 0.0 0.8356 0.9141
No log 50.0 400 0.7957 -0.0032 0.7957 0.8920
No log 50.25 402 0.7959 0.0395 0.7959 0.8922
No log 50.5 404 0.7976 0.0436 0.7976 0.8931
No log 50.75 406 0.7913 -0.0032 0.7913 0.8895
No log 51.0 408 0.7950 -0.1067 0.7950 0.8916
No log 51.25 410 0.7911 -0.1067 0.7911 0.8894
No log 51.5 412 0.7740 -0.1067 0.7740 0.8798
No log 51.75 414 0.7779 -0.0032 0.7779 0.8820
No log 52.0 416 0.8096 -0.1100 0.8096 0.8998
No log 52.25 418 0.8230 -0.0675 0.8230 0.9072
No log 52.5 420 0.8243 -0.1100 0.8243 0.9079
No log 52.75 422 0.8093 -0.0541 0.8093 0.8996
No log 53.0 424 0.8161 -0.1067 0.8161 0.9034
No log 53.25 426 0.8300 -0.1067 0.8300 0.9110
No log 53.5 428 0.8366 -0.1067 0.8366 0.9147
No log 53.75 430 0.8397 -0.0449 0.8397 0.9163
No log 54.0 432 0.8516 -0.1871 0.8516 0.9228
No log 54.25 434 0.8735 -0.1832 0.8735 0.9346
No log 54.5 436 0.8907 -0.2190 0.8907 0.9438
No log 54.75 438 0.8846 -0.1638 0.8846 0.9405
No log 55.0 440 0.8632 -0.1040 0.8632 0.9291
No log 55.25 442 0.8358 -0.1094 0.8358 0.9142
No log 55.5 444 0.8228 -0.1094 0.8228 0.9071
No log 55.75 446 0.8004 -0.1094 0.8004 0.8946
No log 56.0 448 0.8028 -0.1094 0.8028 0.8960
No log 56.25 450 0.8022 -0.1111 0.8022 0.8957
No log 56.5 452 0.7954 -0.1094 0.7954 0.8918
No log 56.75 454 0.8105 -0.1094 0.8105 0.9003
No log 57.0 456 0.8338 -0.1538 0.8338 0.9131
No log 57.25 458 0.8561 -0.1268 0.8561 0.9252
No log 57.5 460 0.8527 -0.1653 0.8527 0.9234
No log 57.75 462 0.8411 -0.1951 0.8411 0.9171
No log 58.0 464 0.8344 -0.1951 0.8344 0.9135
No log 58.25 466 0.8405 -0.1208 0.8405 0.9168
No log 58.5 468 0.8502 -0.1151 0.8502 0.9221
No log 58.75 470 0.8479 -0.1208 0.8479 0.9208
No log 59.0 472 0.8439 -0.1813 0.8439 0.9187
No log 59.25 474 0.8288 -0.0958 0.8288 0.9104
No log 59.5 476 0.8239 -0.1033 0.8239 0.9077
No log 59.75 478 0.8152 -0.1094 0.8152 0.9029
No log 60.0 480 0.8079 -0.1094 0.8079 0.8988
No log 60.25 482 0.8089 -0.1094 0.8089 0.8994
No log 60.5 484 0.7964 -0.1094 0.7964 0.8924
No log 60.75 486 0.7769 -0.0032 0.7769 0.8814
No log 61.0 488 0.7718 -0.0032 0.7718 0.8785
No log 61.25 490 0.7848 -0.0032 0.7848 0.8859
No log 61.5 492 0.7924 -0.0032 0.7924 0.8902
No log 61.75 494 0.8101 -0.0032 0.8101 0.9001
No log 62.0 496 0.8260 -0.0032 0.8260 0.9089
No log 62.25 498 0.8349 -0.1094 0.8349 0.9137
0.1829 62.5 500 0.8380 -0.1094 0.8380 0.9154
0.1829 62.75 502 0.8344 -0.1094 0.8344 0.9134
0.1829 63.0 504 0.8328 -0.1094 0.8328 0.9126
0.1829 63.25 506 0.8333 -0.1094 0.8333 0.9128
0.1829 63.5 508 0.8319 -0.1094 0.8319 0.9121
0.1829 63.75 510 0.8278 -0.1094 0.8278 0.9098

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.4.0+cu118
  • Datasets 2.21.0
  • Tokenizers 0.19.1
Downloads last month
-
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for MayBashendy/ArabicNewSplits7_usingWellWrittenEssays_FineTuningAraBERT_run1_AugV5_k3_task3_organization

Finetuned
(4019)
this model