ArabicNewSplits7_B_usingWellWrittenEssays_FineTuningAraBERT_run3_AugV5_k1_task1_organization

This model is a fine-tuned version of aubmindlab/bert-base-arabertv02 on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 1.1368
  • Qwk: 0.5484
  • Mse: 1.1368
  • Rmse: 1.0662

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 100

Training results

Training Loss Epoch Step Validation Loss Qwk Mse Rmse
No log 0.4 2 7.6299 -0.0308 7.6299 2.7622
No log 0.8 4 4.5417 0.0794 4.5417 2.1311
No log 1.2 6 3.4844 -0.0105 3.4844 1.8666
No log 1.6 8 3.0867 -0.0377 3.0867 1.7569
No log 2.0 10 2.1969 0.0645 2.1969 1.4822
No log 2.4 12 1.8065 0.1964 1.8065 1.3440
No log 2.8 14 1.8786 0.2051 1.8786 1.3706
No log 3.2 16 2.2414 0.1045 2.2414 1.4971
No log 3.6 18 2.2402 0.0735 2.2402 1.4967
No log 4.0 20 1.8247 0.2857 1.8247 1.3508
No log 4.4 22 1.7941 0.2812 1.7941 1.3394
No log 4.8 24 1.5420 0.3607 1.5420 1.2418
No log 5.2 26 1.4241 0.4098 1.4241 1.1933
No log 5.6 28 1.3183 0.4426 1.3183 1.1482
No log 6.0 30 1.3400 0.5333 1.3400 1.1576
No log 6.4 32 1.1850 0.5303 1.1850 1.0886
No log 6.8 34 1.1024 0.5455 1.1024 1.0499
No log 7.2 36 1.1395 0.5652 1.1395 1.0675
No log 7.6 38 1.2358 0.6069 1.2358 1.1116
No log 8.0 40 1.1045 0.5333 1.1045 1.0510
No log 8.4 42 1.0692 0.5522 1.0692 1.0340
No log 8.8 44 1.0970 0.5414 1.0970 1.0474
No log 9.2 46 1.2794 0.5113 1.2794 1.1311
No log 9.6 48 1.4546 0.5270 1.4546 1.2061
No log 10.0 50 1.2370 0.5373 1.2370 1.1122
No log 10.4 52 1.1410 0.5 1.1410 1.0682
No log 10.8 54 1.1533 0.5224 1.1533 1.0739
No log 11.2 56 1.1339 0.4848 1.1339 1.0649
No log 11.6 58 1.1543 0.4688 1.1543 1.0744
No log 12.0 60 1.1744 0.5263 1.1744 1.0837
No log 12.4 62 1.3707 0.4308 1.3707 1.1708
No log 12.8 64 1.2656 0.5231 1.2656 1.1250
No log 13.2 66 1.2280 0.4375 1.2280 1.1082
No log 13.6 68 1.2360 0.4375 1.2360 1.1117
No log 14.0 70 1.2731 0.5271 1.2731 1.1283
No log 14.4 72 1.3545 0.4148 1.3545 1.1638
No log 14.8 74 1.2019 0.5224 1.2019 1.0963
No log 15.2 76 1.1950 0.5522 1.1950 1.0932
No log 15.6 78 1.2492 0.4741 1.2492 1.1177
No log 16.0 80 1.3875 0.4697 1.3875 1.1779
No log 16.4 82 1.3081 0.4923 1.3081 1.1437
No log 16.8 84 1.2797 0.4961 1.2797 1.1312
No log 17.2 86 1.1818 0.5344 1.1818 1.0871
No log 17.6 88 1.2289 0.5197 1.2289 1.1086
No log 18.0 90 1.3600 0.5041 1.3600 1.1662
No log 18.4 92 1.3818 0.5116 1.3818 1.1755
No log 18.8 94 1.3807 0.4769 1.3807 1.1750
No log 19.2 96 1.2215 0.5397 1.2215 1.1052
No log 19.6 98 1.1603 0.4961 1.1603 1.0772
No log 20.0 100 1.1239 0.5354 1.1239 1.0602
No log 20.4 102 1.1364 0.4640 1.1364 1.0660
No log 20.8 104 1.1399 0.4567 1.1399 1.0677
No log 21.2 106 1.1417 0.4885 1.1417 1.0685
No log 21.6 108 1.1118 0.4885 1.1118 1.0544
No log 22.0 110 1.0636 0.4961 1.0636 1.0313
No log 22.4 112 1.1567 0.5581 1.1567 1.0755
No log 22.8 114 1.2047 0.5469 1.2047 1.0976
No log 23.2 116 1.1542 0.5 1.1542 1.0743
No log 23.6 118 1.1580 0.4553 1.1580 1.0761
No log 24.0 120 1.1721 0.5238 1.1721 1.0826
No log 24.4 122 1.1728 0.5625 1.1728 1.0830
No log 24.8 124 1.2093 0.5469 1.2093 1.0997
No log 25.2 126 1.1691 0.5625 1.1691 1.0812
No log 25.6 128 1.1468 0.5469 1.1468 1.0709
No log 26.0 130 1.1117 0.4882 1.1117 1.0544
No log 26.4 132 1.0955 0.5426 1.0955 1.0466
No log 26.8 134 1.1290 0.5238 1.1290 1.0625
No log 27.2 136 1.1819 0.5354 1.1819 1.0872
No log 27.6 138 1.1054 0.5238 1.1054 1.0514
No log 28.0 140 1.0636 0.5397 1.0636 1.0313
No log 28.4 142 1.0720 0.5 1.0720 1.0354
No log 28.8 144 1.1160 0.4754 1.1160 1.0564
No log 29.2 146 1.1377 0.4202 1.1377 1.0666
No log 29.6 148 1.1250 0.4333 1.1250 1.0607
No log 30.0 150 1.1197 0.4793 1.1197 1.0582
No log 30.4 152 1.0749 0.4918 1.0749 1.0368
No log 30.8 154 1.0581 0.5041 1.0581 1.0286
No log 31.2 156 1.1091 0.5625 1.1091 1.0532
No log 31.6 158 1.2786 0.5231 1.2786 1.1307
No log 32.0 160 1.2538 0.5116 1.2538 1.1197
No log 32.4 162 1.0934 0.5736 1.0934 1.0457
No log 32.8 164 1.0420 0.5581 1.0420 1.0208
No log 33.2 166 1.0614 0.5312 1.0614 1.0303
No log 33.6 168 1.0562 0.5512 1.0562 1.0277
No log 34.0 170 1.0927 0.5714 1.0927 1.0453
No log 34.4 172 1.1364 0.512 1.1364 1.0660
No log 34.8 174 1.0691 0.5669 1.0691 1.0340
No log 35.2 176 1.0075 0.5669 1.0075 1.0038
No log 35.6 178 0.9728 0.6061 0.9728 0.9863
No log 36.0 180 0.9568 0.6269 0.9568 0.9782
No log 36.4 182 0.9734 0.5538 0.9734 0.9866
No log 36.8 184 1.0259 0.5426 1.0259 1.0128
No log 37.2 186 1.0236 0.5538 1.0236 1.0117
No log 37.6 188 1.0035 0.6154 1.0035 1.0017
No log 38.0 190 1.0252 0.6260 1.0252 1.0125
No log 38.4 192 1.0558 0.5512 1.0558 1.0275
No log 38.8 194 1.0664 0.5440 1.0664 1.0326
No log 39.2 196 1.0971 0.5714 1.0971 1.0474
No log 39.6 198 1.1888 0.5512 1.1888 1.0903
No log 40.0 200 1.1954 0.5512 1.1954 1.0933
No log 40.4 202 1.0980 0.5354 1.0980 1.0479
No log 40.8 204 0.9938 0.6260 0.9938 0.9969
No log 41.2 206 0.9711 0.6165 0.9711 0.9854
No log 41.6 208 0.9885 0.6212 0.9885 0.9942
No log 42.0 210 0.9978 0.6212 0.9978 0.9989
No log 42.4 212 0.9866 0.6260 0.9866 0.9933
No log 42.8 214 1.0081 0.6061 1.0081 1.0040
No log 43.2 216 0.9905 0.5802 0.9905 0.9952
No log 43.6 218 0.9657 0.6119 0.9657 0.9827
No log 44.0 220 0.9944 0.6119 0.9944 0.9972
No log 44.4 222 1.0528 0.5909 1.0528 1.0261
No log 44.8 224 1.0753 0.5846 1.0753 1.0370
No log 45.2 226 1.0947 0.5469 1.0947 1.0463
No log 45.6 228 1.0953 0.5469 1.0953 1.0466
No log 46.0 230 1.0528 0.5909 1.0528 1.0261
No log 46.4 232 1.0451 0.5649 1.0451 1.0223
No log 46.8 234 1.0476 0.5649 1.0476 1.0235
No log 47.2 236 1.0585 0.5802 1.0585 1.0288
No log 47.6 238 1.0727 0.5426 1.0727 1.0357
No log 48.0 240 1.0966 0.5469 1.0966 1.0472
No log 48.4 242 1.1384 0.5397 1.1384 1.0669
No log 48.8 244 1.1205 0.5312 1.1205 1.0585
No log 49.2 246 1.0638 0.5692 1.0638 1.0314
No log 49.6 248 1.0738 0.5271 1.0738 1.0363
No log 50.0 250 1.1201 0.5039 1.1201 1.0583
No log 50.4 252 1.1607 0.496 1.1607 1.0773
No log 50.8 254 1.2012 0.4878 1.2012 1.0960
No log 51.2 256 1.2007 0.5041 1.2007 1.0957
No log 51.6 258 1.1662 0.48 1.1662 1.0799
No log 52.0 260 1.1422 0.5197 1.1422 1.0687
No log 52.4 262 1.1501 0.5197 1.1501 1.0724
No log 52.8 264 1.1722 0.5 1.1722 1.0827
No log 53.2 266 1.2040 0.5041 1.2040 1.0973
No log 53.6 268 1.2185 0.5203 1.2185 1.1039
No log 54.0 270 1.1990 0.5484 1.1990 1.0950
No log 54.4 272 1.1722 0.5041 1.1722 1.0827
No log 54.8 274 1.1434 0.5161 1.1434 1.0693
No log 55.2 276 1.1207 0.528 1.1207 1.0586
No log 55.6 278 1.1232 0.5161 1.1232 1.0598
No log 56.0 280 1.1546 0.56 1.1546 1.0745
No log 56.4 282 1.1648 0.5484 1.1648 1.0793
No log 56.8 284 1.1770 0.5484 1.1770 1.0849
No log 57.2 286 1.1675 0.5323 1.1675 1.0805
No log 57.6 288 1.1580 0.5161 1.1580 1.0761
No log 58.0 290 1.1637 0.5161 1.1637 1.0788
No log 58.4 292 1.1599 0.5161 1.1599 1.0770
No log 58.8 294 1.1580 0.5161 1.1580 1.0761
No log 59.2 296 1.1478 0.496 1.1478 1.0713
No log 59.6 298 1.1495 0.48 1.1495 1.0722
No log 60.0 300 1.1196 0.4921 1.1196 1.0581
No log 60.4 302 1.1001 0.4882 1.1001 1.0488
No log 60.8 304 1.1166 0.544 1.1166 1.0567
No log 61.2 306 1.1133 0.544 1.1133 1.0551
No log 61.6 308 1.0972 0.4882 1.0972 1.0475
No log 62.0 310 1.0927 0.5 1.0927 1.0453
No log 62.4 312 1.1087 0.5606 1.1087 1.0530
No log 62.8 314 1.1389 0.5312 1.1389 1.0672
No log 63.2 316 1.1474 0.5039 1.1474 1.0712
No log 63.6 318 1.1199 0.5344 1.1199 1.0582
No log 64.0 320 1.1092 0.5039 1.1092 1.0532
No log 64.4 322 1.1324 0.5625 1.1324 1.0641
No log 64.8 324 1.1765 0.56 1.1765 1.0847
No log 65.2 326 1.2027 0.5397 1.2027 1.0967
No log 65.6 328 1.1865 0.5161 1.1865 1.0893
No log 66.0 330 1.1780 0.4921 1.1780 1.0854
No log 66.4 332 1.1914 0.4677 1.1914 1.0915
No log 66.8 334 1.1969 0.48 1.1969 1.0940
No log 67.2 336 1.1870 0.4677 1.1870 1.0895
No log 67.6 338 1.1588 0.4921 1.1588 1.0765
No log 68.0 340 1.1606 0.496 1.1606 1.0773
No log 68.4 342 1.1742 0.5397 1.1742 1.0836
No log 68.8 344 1.1865 0.528 1.1865 1.0893
No log 69.2 346 1.1879 0.5397 1.1879 1.0899
No log 69.6 348 1.1924 0.5397 1.1924 1.0920
No log 70.0 350 1.1919 0.56 1.1919 1.0917
No log 70.4 352 1.1970 0.512 1.1970 1.0941
No log 70.8 354 1.1868 0.496 1.1868 1.0894
No log 71.2 356 1.1827 0.496 1.1827 1.0875
No log 71.6 358 1.1981 0.496 1.1981 1.0946
No log 72.0 360 1.2145 0.5 1.2145 1.1021
No log 72.4 362 1.2288 0.496 1.2288 1.1085
No log 72.8 364 1.2357 0.4754 1.2357 1.1116
No log 73.2 366 1.2386 0.4754 1.2386 1.1129
No log 73.6 368 1.2328 0.4878 1.2328 1.1103
No log 74.0 370 1.2323 0.5082 1.2323 1.1101
No log 74.4 372 1.2228 0.4959 1.2228 1.1058
No log 74.8 374 1.1984 0.4878 1.1984 1.0947
No log 75.2 376 1.1858 0.4878 1.1858 1.0889
No log 75.6 378 1.1817 0.5082 1.1817 1.0871
No log 76.0 380 1.1660 0.4878 1.1660 1.0798
No log 76.4 382 1.1362 0.496 1.1362 1.0659
No log 76.8 384 1.1118 0.5238 1.1118 1.0544
No log 77.2 386 1.0982 0.5512 1.0982 1.0479
No log 77.6 388 1.0972 0.5469 1.0972 1.0475
No log 78.0 390 1.1058 0.56 1.1058 1.0516
No log 78.4 392 1.1307 0.5397 1.1307 1.0633
No log 78.8 394 1.1477 0.528 1.1477 1.0713
No log 79.2 396 1.1517 0.528 1.1517 1.0732
No log 79.6 398 1.1411 0.528 1.1411 1.0682
No log 80.0 400 1.1307 0.5484 1.1307 1.0634
No log 80.4 402 1.1135 0.56 1.1135 1.0552
No log 80.8 404 1.0987 0.5645 1.0987 1.0482
No log 81.2 406 1.1006 0.5440 1.1006 1.0491
No log 81.6 408 1.1120 0.5161 1.1120 1.0545
No log 82.0 410 1.1250 0.5161 1.1250 1.0606
No log 82.4 412 1.1373 0.5203 1.1373 1.0664
No log 82.8 414 1.1380 0.5203 1.1380 1.0668
No log 83.2 416 1.1347 0.496 1.1347 1.0652
No log 83.6 418 1.1350 0.496 1.1350 1.0654
No log 84.0 420 1.1352 0.496 1.1352 1.0655
No log 84.4 422 1.1306 0.496 1.1306 1.0633
No log 84.8 424 1.1228 0.496 1.1228 1.0596
No log 85.2 426 1.1173 0.5238 1.1173 1.0570
No log 85.6 428 1.1138 0.5238 1.1138 1.0554
No log 86.0 430 1.1150 0.5238 1.1150 1.0559
No log 86.4 432 1.1202 0.5238 1.1202 1.0584
No log 86.8 434 1.1241 0.5440 1.1241 1.0602
No log 87.2 436 1.1263 0.5238 1.1263 1.0613
No log 87.6 438 1.1289 0.5238 1.1289 1.0625
No log 88.0 440 1.1343 0.5323 1.1343 1.0651
No log 88.4 442 1.1375 0.5645 1.1375 1.0665
No log 88.8 444 1.1377 0.5645 1.1377 1.0666
No log 89.2 446 1.1404 0.5246 1.1404 1.0679
No log 89.6 448 1.1455 0.5246 1.1455 1.0703
No log 90.0 450 1.1462 0.5246 1.1462 1.0706
No log 90.4 452 1.1450 0.5246 1.1450 1.0701
No log 90.8 454 1.1425 0.5246 1.1425 1.0689
No log 91.2 456 1.1408 0.5246 1.1408 1.0681
No log 91.6 458 1.1370 0.5203 1.1370 1.0663
No log 92.0 460 1.1353 0.5203 1.1353 1.0655
No log 92.4 462 1.1371 0.5203 1.1371 1.0663
No log 92.8 464 1.1376 0.5203 1.1376 1.0666
No log 93.2 466 1.1379 0.5203 1.1379 1.0667
No log 93.6 468 1.1356 0.5203 1.1356 1.0656
No log 94.0 470 1.1337 0.5484 1.1337 1.0648
No log 94.4 472 1.1323 0.5203 1.1323 1.0641
No log 94.8 474 1.1307 0.5203 1.1307 1.0633
No log 95.2 476 1.1314 0.5203 1.1314 1.0637
No log 95.6 478 1.1340 0.5484 1.1340 1.0649
No log 96.0 480 1.1346 0.5484 1.1346 1.0652
No log 96.4 482 1.1348 0.5484 1.1348 1.0653
No log 96.8 484 1.1368 0.5484 1.1368 1.0662
No log 97.2 486 1.1382 0.5484 1.1382 1.0669
No log 97.6 488 1.1394 0.5484 1.1394 1.0674
No log 98.0 490 1.1396 0.5484 1.1396 1.0675
No log 98.4 492 1.1387 0.5484 1.1387 1.0671
No log 98.8 494 1.1379 0.5484 1.1379 1.0667
No log 99.2 496 1.1374 0.5484 1.1374 1.0665
No log 99.6 498 1.1370 0.5484 1.1370 1.0663
0.2257 100.0 500 1.1368 0.5484 1.1368 1.0662

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.4.0+cu118
  • Datasets 2.21.0
  • Tokenizers 0.19.1
Downloads last month
-
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for MayBashendy/ArabicNewSplits7_B_usingWellWrittenEssays_FineTuningAraBERT_run3_AugV5_k1_task1_organization

Finetuned
(4019)
this model