ArabicNewSplits7_usingALLEssays_FineTuningAraBERT_run2_AugV5_k2_task5_organization

This model is a fine-tuned version of aubmindlab/bert-base-arabertv02 on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 0.7457
  • Qwk: 0.5372
  • Mse: 0.7457
  • Rmse: 0.8636

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 100

Training results

Training Loss Epoch Step Validation Loss Qwk Mse Rmse
No log 0.2222 2 3.8336 -0.0332 3.8336 1.9580
No log 0.4444 4 2.1577 0.0816 2.1577 1.4689
No log 0.6667 6 1.3138 0.1142 1.3138 1.1462
No log 0.8889 8 1.0630 0.2391 1.0630 1.0310
No log 1.1111 10 1.0292 0.3048 1.0292 1.0145
No log 1.3333 12 1.0459 0.1954 1.0459 1.0227
No log 1.5556 14 1.1124 0.1657 1.1124 1.0547
No log 1.7778 16 1.2891 0.0256 1.2891 1.1354
No log 2.0 18 1.2912 -0.0296 1.2912 1.1363
No log 2.2222 20 1.1895 0.0627 1.1895 1.0906
No log 2.4444 22 1.3410 0.0399 1.3410 1.1580
No log 2.6667 24 1.3003 0.0769 1.3003 1.1403
No log 2.8889 26 1.0863 0.0967 1.0863 1.0423
No log 3.1111 28 0.9979 0.2841 0.9979 0.9989
No log 3.3333 30 0.9876 0.1981 0.9876 0.9938
No log 3.5556 32 0.9706 0.2149 0.9706 0.9852
No log 3.7778 34 0.9023 0.3498 0.9023 0.9499
No log 4.0 36 0.9478 0.3645 0.9478 0.9736
No log 4.2222 38 0.9862 0.2958 0.9862 0.9931
No log 4.4444 40 0.9697 0.2958 0.9697 0.9847
No log 4.6667 42 0.8208 0.4256 0.8208 0.9060
No log 4.8889 44 0.7774 0.4384 0.7774 0.8817
No log 5.1111 46 0.7648 0.4951 0.7648 0.8745
No log 5.3333 48 0.7551 0.5436 0.7551 0.8690
No log 5.5556 50 0.7526 0.4893 0.7526 0.8675
No log 5.7778 52 0.7465 0.5247 0.7465 0.8640
No log 6.0 54 0.7699 0.5438 0.7699 0.8774
No log 6.2222 56 0.8654 0.4796 0.8654 0.9303
No log 6.4444 58 0.8348 0.5188 0.8348 0.9137
No log 6.6667 60 0.8063 0.4304 0.8063 0.8979
No log 6.8889 62 0.8675 0.4467 0.8675 0.9314
No log 7.1111 64 0.7943 0.4321 0.7943 0.8912
No log 7.3333 66 0.8179 0.5243 0.8179 0.9044
No log 7.5556 68 0.9066 0.4898 0.9066 0.9522
No log 7.7778 70 0.8224 0.4907 0.8224 0.9068
No log 8.0 72 0.7027 0.5626 0.7027 0.8383
No log 8.2222 74 0.6780 0.6451 0.6780 0.8234
No log 8.4444 76 0.7229 0.5410 0.7229 0.8502
No log 8.6667 78 0.7739 0.5591 0.7739 0.8797
No log 8.8889 80 0.7733 0.5902 0.7733 0.8794
No log 9.1111 82 0.7911 0.5611 0.7911 0.8894
No log 9.3333 84 0.7158 0.5618 0.7158 0.8461
No log 9.5556 86 0.6532 0.6104 0.6532 0.8082
No log 9.7778 88 0.6452 0.6354 0.6452 0.8033
No log 10.0 90 0.7161 0.5380 0.7161 0.8462
No log 10.2222 92 0.7980 0.5331 0.7980 0.8933
No log 10.4444 94 0.6949 0.5763 0.6949 0.8336
No log 10.6667 96 0.6511 0.6569 0.6511 0.8069
No log 10.8889 98 0.6477 0.6390 0.6477 0.8048
No log 11.1111 100 0.6748 0.5330 0.6748 0.8215
No log 11.3333 102 0.8293 0.5428 0.8293 0.9107
No log 11.5556 104 0.8306 0.5867 0.8306 0.9114
No log 11.7778 106 0.7187 0.5192 0.7187 0.8478
No log 12.0 108 0.7163 0.6022 0.7163 0.8463
No log 12.2222 110 0.7169 0.6022 0.7169 0.8467
No log 12.4444 112 0.6915 0.5557 0.6915 0.8316
No log 12.6667 114 0.7423 0.5912 0.7423 0.8615
No log 12.8889 116 0.7890 0.5610 0.7890 0.8882
No log 13.1111 118 0.7416 0.5992 0.7416 0.8612
No log 13.3333 120 0.6555 0.6241 0.6555 0.8096
No log 13.5556 122 0.6300 0.6209 0.6300 0.7937
No log 13.7778 124 0.6367 0.6209 0.6367 0.7979
No log 14.0 126 0.6317 0.6361 0.6317 0.7948
No log 14.2222 128 0.5940 0.5944 0.5940 0.7707
No log 14.4444 130 0.6005 0.6091 0.6005 0.7749
No log 14.6667 132 0.5889 0.5919 0.5889 0.7674
No log 14.8889 134 0.6198 0.6282 0.6198 0.7872
No log 15.1111 136 0.6139 0.6209 0.6139 0.7835
No log 15.3333 138 0.5827 0.5972 0.5827 0.7633
No log 15.5556 140 0.5741 0.6175 0.5741 0.7577
No log 15.7778 142 0.6153 0.6272 0.6153 0.7844
No log 16.0 144 0.6469 0.6209 0.6469 0.8043
No log 16.2222 146 0.6112 0.6429 0.6112 0.7818
No log 16.4444 148 0.5612 0.6822 0.5612 0.7492
No log 16.6667 150 0.5608 0.6866 0.5608 0.7489
No log 16.8889 152 0.5761 0.6584 0.5761 0.7590
No log 17.1111 154 0.7275 0.5745 0.7275 0.8529
No log 17.3333 156 0.7716 0.5230 0.7716 0.8784
No log 17.5556 158 0.6853 0.6305 0.6853 0.8278
No log 17.7778 160 0.6495 0.5405 0.6495 0.8059
No log 18.0 162 0.6351 0.5759 0.6351 0.7969
No log 18.2222 164 0.6688 0.6167 0.6688 0.8178
No log 18.4444 166 0.7520 0.6010 0.7520 0.8672
No log 18.6667 168 0.7454 0.6218 0.7454 0.8634
No log 18.8889 170 0.6268 0.6764 0.6268 0.7917
No log 19.1111 172 0.6768 0.5782 0.6768 0.8227
No log 19.3333 174 0.7543 0.5565 0.7543 0.8685
No log 19.5556 176 0.7019 0.5353 0.7019 0.8378
No log 19.7778 178 0.6055 0.5656 0.6055 0.7781
No log 20.0 180 0.6742 0.6032 0.6742 0.8211
No log 20.2222 182 0.7125 0.6160 0.7125 0.8441
No log 20.4444 184 0.6222 0.5975 0.6222 0.7888
No log 20.6667 186 0.5399 0.6623 0.5399 0.7348
No log 20.8889 188 0.5376 0.7007 0.5376 0.7332
No log 21.1111 190 0.5287 0.7285 0.5287 0.7271
No log 21.3333 192 0.5648 0.6510 0.5648 0.7515
No log 21.5556 194 0.6838 0.5686 0.6838 0.8270
No log 21.7778 196 0.7340 0.5686 0.7340 0.8567
No log 22.0 198 0.6803 0.6127 0.6803 0.8248
No log 22.2222 200 0.6386 0.5774 0.6386 0.7991
No log 22.4444 202 0.6358 0.5396 0.6358 0.7974
No log 22.6667 204 0.6154 0.6001 0.6154 0.7845
No log 22.8889 206 0.6097 0.6219 0.6097 0.7808
No log 23.1111 208 0.6457 0.6188 0.6457 0.8035
No log 23.3333 210 0.7176 0.5675 0.7176 0.8471
No log 23.5556 212 0.7103 0.5811 0.7103 0.8428
No log 23.7778 214 0.6602 0.6101 0.6602 0.8125
No log 24.0 216 0.6328 0.6199 0.6328 0.7955
No log 24.2222 218 0.6270 0.6396 0.6270 0.7918
No log 24.4444 220 0.6488 0.6137 0.6488 0.8055
No log 24.6667 222 0.7151 0.6004 0.7151 0.8457
No log 24.8889 224 0.7710 0.5670 0.7710 0.8780
No log 25.1111 226 0.7801 0.5370 0.7801 0.8832
No log 25.3333 228 0.6991 0.6630 0.6991 0.8361
No log 25.5556 230 0.6230 0.6405 0.6230 0.7893
No log 25.7778 232 0.5948 0.6196 0.5948 0.7712
No log 26.0 234 0.5947 0.6317 0.5947 0.7712
No log 26.2222 236 0.6046 0.6327 0.6046 0.7776
No log 26.4444 238 0.6132 0.6122 0.6132 0.7831
No log 26.6667 240 0.6448 0.6386 0.6448 0.8030
No log 26.8889 242 0.6935 0.5988 0.6935 0.8328
No log 27.1111 244 0.7342 0.5788 0.7342 0.8569
No log 27.3333 246 0.6795 0.5788 0.6795 0.8243
No log 27.5556 248 0.6065 0.6102 0.6065 0.7788
No log 27.7778 250 0.5949 0.6335 0.5949 0.7713
No log 28.0 252 0.5957 0.6335 0.5957 0.7718
No log 28.2222 254 0.6138 0.5894 0.6138 0.7834
No log 28.4444 256 0.6674 0.5846 0.6674 0.8170
No log 28.6667 258 0.7303 0.5675 0.7303 0.8546
No log 28.8889 260 0.6962 0.5916 0.6962 0.8344
No log 29.1111 262 0.6412 0.6188 0.6412 0.8007
No log 29.3333 264 0.6326 0.6188 0.6326 0.7954
No log 29.5556 266 0.6641 0.6300 0.6641 0.8149
No log 29.7778 268 0.6605 0.6300 0.6605 0.8127
No log 30.0 270 0.5866 0.6377 0.5866 0.7659
No log 30.2222 272 0.5532 0.6623 0.5532 0.7437
No log 30.4444 274 0.5552 0.6422 0.5552 0.7451
No log 30.6667 276 0.5588 0.6422 0.5588 0.7476
No log 30.8889 278 0.5433 0.6664 0.5433 0.7371
No log 31.1111 280 0.5501 0.6871 0.5501 0.7417
No log 31.3333 282 0.5536 0.6871 0.5536 0.7440
No log 31.5556 284 0.5483 0.6539 0.5483 0.7405
No log 31.7778 286 0.5432 0.6655 0.5432 0.7370
No log 32.0 288 0.5403 0.6655 0.5403 0.7350
No log 32.2222 290 0.5698 0.6558 0.5698 0.7549
No log 32.4444 292 0.5924 0.6670 0.5924 0.7697
No log 32.6667 294 0.5945 0.6199 0.5945 0.7710
No log 32.8889 296 0.6410 0.5846 0.6410 0.8006
No log 33.1111 298 0.6382 0.5846 0.6382 0.7989
No log 33.3333 300 0.6684 0.5509 0.6684 0.8176
No log 33.5556 302 0.7163 0.5618 0.7163 0.8464
No log 33.7778 304 0.6891 0.5527 0.6891 0.8301
No log 34.0 306 0.6585 0.5527 0.6585 0.8115
No log 34.2222 308 0.6238 0.6003 0.6238 0.7898
No log 34.4444 310 0.6269 0.6167 0.6269 0.7918
No log 34.6667 312 0.6197 0.6282 0.6197 0.7872
No log 34.8889 314 0.6166 0.6282 0.6166 0.7853
No log 35.1111 316 0.6293 0.6167 0.6293 0.7933
No log 35.3333 318 0.6352 0.6167 0.6352 0.7970
No log 35.5556 320 0.6814 0.5487 0.6814 0.8255
No log 35.7778 322 0.6967 0.5370 0.6967 0.8347
No log 36.0 324 0.6642 0.5846 0.6642 0.8150
No log 36.2222 326 0.6327 0.5964 0.6327 0.7954
No log 36.4444 328 0.6084 0.6177 0.6084 0.7800
No log 36.6667 330 0.6354 0.6218 0.6354 0.7971
No log 36.8889 332 0.6874 0.5526 0.6874 0.8291
No log 37.1111 334 0.6710 0.5636 0.6710 0.8192
No log 37.3333 336 0.6415 0.5912 0.6415 0.8009
No log 37.5556 338 0.5892 0.6291 0.5892 0.7676
No log 37.7778 340 0.5807 0.6291 0.5807 0.7621
No log 38.0 342 0.6052 0.6177 0.6052 0.7779
No log 38.2222 344 0.6385 0.5948 0.6385 0.7991
No log 38.4444 346 0.6742 0.5888 0.6742 0.8211
No log 38.6667 348 0.6698 0.5998 0.6698 0.8184
No log 38.8889 350 0.6627 0.6053 0.6627 0.8141
No log 39.1111 352 0.6380 0.6092 0.6380 0.7987
No log 39.3333 354 0.6397 0.5975 0.6397 0.7998
No log 39.5556 356 0.6297 0.6706 0.6297 0.7935
No log 39.7778 358 0.6324 0.6276 0.6324 0.7952
No log 40.0 360 0.6509 0.6021 0.6509 0.8068
No log 40.2222 362 0.6739 0.5846 0.6739 0.8209
No log 40.4444 364 0.6604 0.5718 0.6604 0.8126
No log 40.6667 366 0.6588 0.5833 0.6588 0.8117
No log 40.8889 368 0.6849 0.5718 0.6849 0.8276
No log 41.1111 370 0.6923 0.5718 0.6923 0.8320
No log 41.3333 372 0.7258 0.5591 0.7258 0.8519
No log 41.5556 374 0.7379 0.5591 0.7379 0.8590
No log 41.7778 376 0.7001 0.5998 0.7001 0.8367
No log 42.0 378 0.6477 0.6035 0.6477 0.8048
No log 42.2222 380 0.6419 0.6432 0.6419 0.8012
No log 42.4444 382 0.6697 0.5810 0.6697 0.8183
No log 42.6667 384 0.7233 0.5385 0.7233 0.8504
No log 42.8889 386 0.7501 0.5372 0.7501 0.8661
No log 43.1111 388 0.7076 0.5385 0.7076 0.8412
No log 43.3333 390 0.7023 0.5385 0.7023 0.8380
No log 43.5556 392 0.6814 0.5602 0.6814 0.8255
No log 43.7778 394 0.6457 0.6035 0.6457 0.8036
No log 44.0 396 0.6403 0.6035 0.6403 0.8002
No log 44.2222 398 0.6279 0.6035 0.6279 0.7924
No log 44.4444 400 0.5960 0.6291 0.5960 0.7720
No log 44.6667 402 0.5902 0.6291 0.5902 0.7683
No log 44.8889 404 0.6019 0.6291 0.6019 0.7758
No log 45.1111 406 0.6352 0.6035 0.6352 0.7970
No log 45.3333 408 0.6597 0.5718 0.6597 0.8122
No log 45.5556 410 0.6939 0.5902 0.6939 0.8330
No log 45.7778 412 0.6737 0.5707 0.6737 0.8208
No log 46.0 414 0.6399 0.5844 0.6399 0.8000
No log 46.2222 416 0.6362 0.6092 0.6362 0.7976
No log 46.4444 418 0.6416 0.5844 0.6416 0.8010
No log 46.6667 420 0.6589 0.5822 0.6589 0.8118
No log 46.8889 422 0.6679 0.5707 0.6679 0.8172
No log 47.1111 424 0.6562 0.5718 0.6562 0.8101
No log 47.3333 426 0.6473 0.6167 0.6473 0.8046
No log 47.5556 428 0.6391 0.6282 0.6391 0.7994
No log 47.7778 430 0.6357 0.6209 0.6357 0.7973
No log 48.0 432 0.6412 0.5597 0.6412 0.8008
No log 48.2222 434 0.6525 0.6199 0.6525 0.8078
No log 48.4444 436 0.6689 0.6053 0.6689 0.8179
No log 48.6667 438 0.7018 0.5835 0.7018 0.8377
No log 48.8889 440 0.7090 0.5835 0.7090 0.8420
No log 49.1111 442 0.7035 0.6042 0.7035 0.8387
No log 49.3333 444 0.6771 0.5938 0.6771 0.8228
No log 49.5556 446 0.6462 0.6167 0.6462 0.8039
No log 49.7778 448 0.6219 0.6405 0.6219 0.7886
No log 50.0 450 0.6160 0.6405 0.6160 0.7848
No log 50.2222 452 0.6303 0.6396 0.6303 0.7939
No log 50.4444 454 0.6560 0.5810 0.6560 0.8099
No log 50.6667 456 0.6540 0.6053 0.6540 0.8087
No log 50.8889 458 0.6662 0.5810 0.6662 0.8162
No log 51.1111 460 0.6511 0.5923 0.6511 0.8069
No log 51.3333 462 0.6200 0.6282 0.6200 0.7874
No log 51.5556 464 0.6172 0.6282 0.6172 0.7856
No log 51.7778 466 0.6256 0.5923 0.6256 0.7910
No log 52.0 468 0.6356 0.5923 0.6356 0.7972
No log 52.2222 470 0.6644 0.5923 0.6644 0.8151
No log 52.4444 472 0.6641 0.5810 0.6641 0.8149
No log 52.6667 474 0.6557 0.5602 0.6557 0.8097
No log 52.8889 476 0.6568 0.5740 0.6568 0.8104
No log 53.1111 478 0.6603 0.5663 0.6603 0.8126
No log 53.3333 480 0.6774 0.5825 0.6774 0.8230
No log 53.5556 482 0.7109 0.5651 0.7109 0.8431
No log 53.7778 484 0.7478 0.5027 0.7478 0.8647
No log 54.0 486 0.7526 0.4907 0.7526 0.8675
No log 54.2222 488 0.7258 0.5147 0.7258 0.8519
No log 54.4444 490 0.6949 0.5385 0.6949 0.8336
No log 54.6667 492 0.7012 0.5385 0.7012 0.8374
No log 54.8889 494 0.6973 0.5385 0.6973 0.8351
No log 55.1111 496 0.6637 0.5718 0.6637 0.8147
No log 55.3333 498 0.6491 0.6082 0.6491 0.8056
0.2007 55.5556 500 0.6663 0.5630 0.6663 0.8163
0.2007 55.7778 502 0.6881 0.5630 0.6881 0.8295
0.2007 56.0 504 0.7443 0.4902 0.7443 0.8627
0.2007 56.2222 506 0.7786 0.4902 0.7786 0.8824
0.2007 56.4444 508 0.7601 0.5128 0.7601 0.8718
0.2007 56.6667 510 0.7457 0.5372 0.7457 0.8636

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.4.0+cu118
  • Datasets 2.21.0
  • Tokenizers 0.19.1
Downloads last month
-
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for MayBashendy/ArabicNewSplits7_usingALLEssays_FineTuningAraBERT_run2_AugV5_k2_task5_organization

Finetuned
(4019)
this model