ArabicNewSplits7_usingWellWrittenEssays_FineTuningAraBERT_run3_AugV5_k14_task2_organization

This model is a fine-tuned version of aubmindlab/bert-base-arabertv02 on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 1.2571
  • Qwk: 0.1247
  • Mse: 1.2571
  • Rmse: 1.1212

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 100

Training results

Training Loss Epoch Step Validation Loss Qwk Mse Rmse
No log 0.0488 2 4.7286 -0.0020 4.7286 2.1745
No log 0.0976 4 2.6619 0.0159 2.6619 1.6315
No log 0.1463 6 1.8540 -0.0017 1.8540 1.3616
No log 0.1951 8 2.3466 -0.0899 2.3466 1.5319
No log 0.2439 10 1.8112 -0.1104 1.8112 1.3458
No log 0.2927 12 1.5655 -0.0441 1.5655 1.2512
No log 0.3415 14 1.4131 0.0438 1.4131 1.1887
No log 0.3902 16 1.3104 0.0889 1.3104 1.1447
No log 0.4390 18 1.3648 -0.0407 1.3648 1.1683
No log 0.4878 20 1.5208 -0.0441 1.5208 1.2332
No log 0.5366 22 1.7310 -0.0441 1.7310 1.3157
No log 0.5854 24 1.8733 0.0227 1.8733 1.3687
No log 0.6341 26 1.7635 0.0082 1.7635 1.3280
No log 0.6829 28 1.4427 0.0297 1.4427 1.2011
No log 0.7317 30 1.2255 0.1570 1.2255 1.1070
No log 0.7805 32 1.2079 0.1009 1.2079 1.0991
No log 0.8293 34 1.2001 0.0941 1.2001 1.0955
No log 0.8780 36 1.2112 0.0527 1.2112 1.1005
No log 0.9268 38 1.2431 0.0527 1.2431 1.1150
No log 0.9756 40 1.2270 0.0499 1.2270 1.1077
No log 1.0244 42 1.2162 0.0180 1.2162 1.1028
No log 1.0732 44 1.2030 0.0810 1.2030 1.0968
No log 1.1220 46 1.1763 0.0642 1.1763 1.0846
No log 1.1707 48 1.1521 0.1278 1.1521 1.0734
No log 1.2195 50 1.1181 0.1773 1.1181 1.0574
No log 1.2683 52 1.1375 0.1507 1.1375 1.0666
No log 1.3171 54 1.1121 0.1711 1.1121 1.0546
No log 1.3659 56 1.0773 0.1649 1.0773 1.0379
No log 1.4146 58 1.1092 0.1148 1.1092 1.0532
No log 1.4634 60 1.2166 0.1188 1.2166 1.1030
No log 1.5122 62 1.2361 0.0692 1.2361 1.1118
No log 1.5610 64 1.1673 0.1507 1.1673 1.0804
No log 1.6098 66 1.1814 0.2485 1.1814 1.0869
No log 1.6585 68 1.2532 0.0432 1.2532 1.1195
No log 1.7073 70 1.2262 0.1168 1.2262 1.1073
No log 1.7561 72 1.1245 0.1696 1.1245 1.0604
No log 1.8049 74 1.0579 0.4116 1.0579 1.0285
No log 1.8537 76 1.0500 0.3408 1.0500 1.0247
No log 1.9024 78 1.1346 0.1958 1.1346 1.0652
No log 1.9512 80 1.3481 0.2099 1.3481 1.1611
No log 2.0 82 1.4895 0.3331 1.4895 1.2205
No log 2.0488 84 1.4847 0.3345 1.4847 1.2185
No log 2.0976 86 1.3033 0.2251 1.3033 1.1416
No log 2.1463 88 1.2492 0.2251 1.2492 1.1177
No log 2.1951 90 1.5188 0.3371 1.5188 1.2324
No log 2.2439 92 1.5892 0.3189 1.5892 1.2607
No log 2.2927 94 1.4744 0.3345 1.4744 1.2142
No log 2.3415 96 1.3962 0.3542 1.3962 1.1816
No log 2.3902 98 1.3869 0.3140 1.3869 1.1777
No log 2.4390 100 1.4793 0.3407 1.4793 1.2163
No log 2.4878 102 1.7856 0.2943 1.7856 1.3363
No log 2.5366 104 1.8342 0.2222 1.8342 1.3543
No log 2.5854 106 1.7025 0.2691 1.7025 1.3048
No log 2.6341 108 1.3838 0.3222 1.3838 1.1764
No log 2.6829 110 1.2841 0.3442 1.2841 1.1332
No log 2.7317 112 1.5093 0.2847 1.5093 1.2285
No log 2.7805 114 1.7802 0.2008 1.7802 1.3343
No log 2.8293 116 1.9692 0.2038 1.9692 1.4033
No log 2.8780 118 1.9013 0.2008 1.9013 1.3789
No log 2.9268 120 1.6029 0.2540 1.6029 1.2660
No log 2.9756 122 1.3051 0.3874 1.3051 1.1424
No log 3.0244 124 1.1771 0.2563 1.1771 1.0850
No log 3.0732 126 1.3332 0.3693 1.3332 1.1547
No log 3.1220 128 1.7713 0.1978 1.7713 1.3309
No log 3.1707 130 1.9234 0.0878 1.9234 1.3869
No log 3.2195 132 1.7836 0.1242 1.7836 1.3355
No log 3.2683 134 1.4600 0.3442 1.4600 1.2083
No log 3.3171 136 1.2832 0.3231 1.2832 1.1328
No log 3.3659 138 1.1054 0.1968 1.1054 1.0514
No log 3.4146 140 1.0721 0.2132 1.0721 1.0354
No log 3.4634 142 1.1512 0.2357 1.1512 1.0729
No log 3.5122 144 1.3318 0.2895 1.3318 1.1540
No log 3.5610 146 1.3353 0.2060 1.3353 1.1556
No log 3.6098 148 1.2792 0.1310 1.2792 1.1310
No log 3.6585 150 1.2802 0.1807 1.2802 1.1315
No log 3.7073 152 1.2648 0.2223 1.2648 1.1246
No log 3.7561 154 1.4640 0.2711 1.4640 1.2099
No log 3.8049 156 1.7420 0.2419 1.7420 1.3198
No log 3.8537 158 1.7047 0.2564 1.7047 1.3056
No log 3.9024 160 1.3902 0.3523 1.3902 1.1791
No log 3.9512 162 1.1262 0.2623 1.1262 1.0612
No log 4.0 164 1.0873 0.2850 1.0873 1.0427
No log 4.0488 166 1.0923 0.3111 1.0923 1.0451
No log 4.0976 168 1.1990 0.2037 1.1990 1.0950
No log 4.1463 170 1.5936 0.3642 1.5936 1.2624
No log 4.1951 172 1.7958 0.2469 1.7958 1.3401
No log 4.2439 174 1.6506 0.2887 1.6506 1.2848
No log 4.2927 176 1.3751 0.2275 1.3751 1.1727
No log 4.3415 178 1.1841 0.1911 1.1841 1.0882
No log 4.3902 180 1.1714 0.1602 1.1714 1.0823
No log 4.4390 182 1.2242 0.2291 1.2242 1.1064
No log 4.4878 184 1.3428 0.2520 1.3428 1.1588
No log 4.5366 186 1.5401 0.3358 1.5401 1.2410
No log 4.5854 188 1.5631 0.3189 1.5631 1.2502
No log 4.6341 190 1.3892 0.2929 1.3892 1.1786
No log 4.6829 192 1.2517 0.3512 1.2517 1.1188
No log 4.7317 194 1.2064 0.2833 1.2064 1.0983
No log 4.7805 196 1.1623 0.2084 1.1623 1.0781
No log 4.8293 198 1.2342 0.2544 1.2342 1.1109
No log 4.8780 200 1.4386 0.2929 1.4386 1.1994
No log 4.9268 202 1.6007 0.2847 1.6007 1.2652
No log 4.9756 204 1.6596 0.2644 1.6596 1.2882
No log 5.0244 206 1.5757 0.2378 1.5757 1.2553
No log 5.0732 208 1.3330 0.2617 1.3330 1.1545
No log 5.1220 210 1.0983 0.1602 1.0983 1.0480
No log 5.1707 212 1.0707 0.1977 1.0707 1.0347
No log 5.2195 214 1.1838 0.2141 1.1838 1.0880
No log 5.2683 216 1.4988 0.2105 1.4988 1.2242
No log 5.3171 218 1.7426 0.1882 1.7426 1.3201
No log 5.3659 220 1.7970 0.1555 1.7970 1.3405
No log 5.4146 222 1.6555 0.1667 1.6555 1.2867
No log 5.4634 224 1.4710 0.1031 1.4710 1.2128
No log 5.5122 226 1.3257 0.1795 1.3257 1.1514
No log 5.5610 228 1.2421 0.1795 1.2421 1.1145
No log 5.6098 230 1.2560 0.2000 1.2560 1.1207
No log 5.6585 232 1.3226 0.2367 1.3226 1.1500
No log 5.7073 234 1.4554 0.2436 1.4554 1.2064
No log 5.7561 236 1.5059 0.2658 1.5059 1.2272
No log 5.8049 238 1.4021 0.2405 1.4021 1.1841
No log 5.8537 240 1.2976 0.1670 1.2976 1.1391
No log 5.9024 242 1.2278 0.2100 1.2278 1.1081
No log 5.9512 244 1.2483 0.2650 1.2483 1.1173
No log 6.0 246 1.3409 0.2875 1.3409 1.1580
No log 6.0488 248 1.3887 0.2489 1.3887 1.1784
No log 6.0976 250 1.4015 0.2704 1.4015 1.1839
No log 6.1463 252 1.3010 0.2725 1.3010 1.1406
No log 6.1951 254 1.1759 0.2762 1.1759 1.0844
No log 6.2439 256 1.1902 0.2762 1.1902 1.0909
No log 6.2927 258 1.2748 0.3225 1.2748 1.1291
No log 6.3415 260 1.2879 0.2559 1.2879 1.1349
No log 6.3902 262 1.1826 0.2570 1.1826 1.0875
No log 6.4390 264 1.1583 0.2904 1.1583 1.0762
No log 6.4878 266 1.1708 0.2870 1.1708 1.0821
No log 6.5366 268 1.2614 0.2938 1.2614 1.1231
No log 6.5854 270 1.3780 0.2886 1.3780 1.1739
No log 6.6341 272 1.4299 0.2735 1.4299 1.1958
No log 6.6829 274 1.3857 0.2659 1.3857 1.1772
No log 6.7317 276 1.3043 0.2390 1.3043 1.1421
No log 6.7805 278 1.1305 0.2263 1.1305 1.0633
No log 6.8293 280 1.0686 0.2317 1.0686 1.0337
No log 6.8780 282 1.1172 0.2152 1.1172 1.0570
No log 6.9268 284 1.2526 0.2184 1.2526 1.1192
No log 6.9756 286 1.4296 0.2289 1.4296 1.1957
No log 7.0244 288 1.4402 0.1646 1.4402 1.2001
No log 7.0732 290 1.3385 0.1219 1.3385 1.1569
No log 7.1220 292 1.3203 0.1637 1.3203 1.1490
No log 7.1707 294 1.4094 0.1345 1.4094 1.1872
No log 7.2195 296 1.3899 0.1345 1.3899 1.1790
No log 7.2683 298 1.3109 0.1316 1.3109 1.1449
No log 7.3171 300 1.3471 0.0887 1.3471 1.1606
No log 7.3659 302 1.3688 0.0887 1.3688 1.1700
No log 7.4146 304 1.4116 0.0702 1.4116 1.1881
No log 7.4634 306 1.4376 0.0702 1.4376 1.1990
No log 7.5122 308 1.4261 0.0887 1.4261 1.1942
No log 7.5610 310 1.3489 0.0887 1.3489 1.1614
No log 7.6098 312 1.2595 0.1316 1.2595 1.1223
No log 7.6585 314 1.2574 0.1637 1.2574 1.1213
No log 7.7073 316 1.3882 0.1283 1.3882 1.1782
No log 7.7561 318 1.5390 0.1386 1.5390 1.2406
No log 7.8049 320 1.5306 0.1498 1.5306 1.2372
No log 7.8537 322 1.4436 0.1224 1.4436 1.2015
No log 7.9024 324 1.4143 0.1224 1.4143 1.1892
No log 7.9512 326 1.3174 0.1345 1.3174 1.1478
No log 8.0 328 1.2819 0.1345 1.2819 1.1322
No log 8.0488 330 1.3807 0.1646 1.3807 1.1751
No log 8.0976 332 1.5046 0.1228 1.5046 1.2266
No log 8.1463 334 1.4527 0.0920 1.4527 1.2053
No log 8.1951 336 1.2695 0.1053 1.2695 1.1267
No log 8.2439 338 1.1957 0.1417 1.1957 1.0935
No log 8.2927 340 1.2454 0.0802 1.2454 1.1160
No log 8.3415 342 1.3656 0.0887 1.3656 1.1686
No log 8.3902 344 1.4173 0.0887 1.4173 1.1905
No log 8.4390 346 1.4699 0.1219 1.4699 1.2124
No log 8.4878 348 1.4377 0.0778 1.4377 1.1990
No log 8.5366 350 1.4452 0.2024 1.4452 1.2022
No log 8.5854 352 1.4195 0.2289 1.4195 1.1914
No log 8.6341 354 1.3637 0.2390 1.3637 1.1678
No log 8.6829 356 1.3446 0.2390 1.3446 1.1596
No log 8.7317 358 1.3550 0.1979 1.3550 1.1640
No log 8.7805 360 1.3915 0.1929 1.3915 1.1796
No log 8.8293 362 1.3584 0.1667 1.3584 1.1655
No log 8.8780 364 1.2855 0.1371 1.2855 1.1338
No log 8.9268 366 1.2215 0.1611 1.2215 1.1052
No log 8.9756 368 1.2944 0.2550 1.2944 1.1377
No log 9.0244 370 1.4655 0.2950 1.4655 1.2106
No log 9.0732 372 1.4503 0.3140 1.4503 1.2043
No log 9.1220 374 1.3267 0.3087 1.3267 1.1518
No log 9.1707 376 1.2479 0.1758 1.2479 1.1171
No log 9.2195 378 1.2737 0.1462 1.2737 1.1286
No log 9.2683 380 1.3474 0.0920 1.3474 1.1608
No log 9.3171 382 1.3306 0.1313 1.3306 1.1535
No log 9.3659 384 1.3293 0.1608 1.3293 1.1530
No log 9.4146 386 1.2529 0.1313 1.2529 1.1193
No log 9.4634 388 1.1654 0.1611 1.1654 1.0795
No log 9.5122 390 1.1752 0.1753 1.1752 1.0841
No log 9.5610 392 1.3353 0.3271 1.3353 1.1556
No log 9.6098 394 1.4569 0.3140 1.4569 1.2070
No log 9.6585 396 1.3803 0.3140 1.3803 1.1749
No log 9.7073 398 1.1516 0.2914 1.1516 1.0731
No log 9.7561 400 0.9759 0.2395 0.9759 0.9879
No log 9.8049 402 0.9537 0.2752 0.9537 0.9766
No log 9.8537 404 1.0349 0.2207 1.0349 1.0173
No log 9.9024 406 1.2107 0.1703 1.2107 1.1003
No log 9.9512 408 1.4143 0.1698 1.4143 1.1893
No log 10.0 410 1.4751 0.1461 1.4751 1.2145
No log 10.0488 412 1.3884 0.1188 1.3884 1.1783
No log 10.0976 414 1.3210 0.1076 1.3210 1.1494
No log 10.1463 416 1.2441 0.1247 1.2441 1.1154
No log 10.1951 418 1.2940 0.2097 1.2940 1.1376
No log 10.2439 420 1.4335 0.1516 1.4335 1.1973
No log 10.2927 422 1.5324 0.2251 1.5324 1.2379
No log 10.3415 424 1.4807 0.2863 1.4807 1.2168
No log 10.3902 426 1.4249 0.3277 1.4249 1.1937
No log 10.4390 428 1.2963 0.2807 1.2963 1.1385
No log 10.4878 430 1.2512 0.2559 1.2512 1.1186
No log 10.5366 432 1.3365 0.3500 1.3365 1.1560
No log 10.5854 434 1.4233 0.3512 1.4233 1.1930
No log 10.6341 436 1.4165 0.3405 1.4165 1.1902
No log 10.6829 438 1.3202 0.3309 1.3202 1.1490
No log 10.7317 440 1.3427 0.3523 1.3427 1.1588
No log 10.7805 442 1.4554 0.3269 1.4554 1.2064
No log 10.8293 444 1.4778 0.3157 1.4778 1.2157
No log 10.8780 446 1.3583 0.3047 1.3583 1.1655
No log 10.9268 448 1.3442 0.2201 1.3442 1.1594
No log 10.9756 450 1.3976 0.2024 1.3976 1.1822
No log 11.0244 452 1.4572 0.2024 1.4572 1.2071
No log 11.0732 454 1.3745 0.1219 1.3745 1.1724
No log 11.1220 456 1.2735 0.0887 1.2735 1.1285
No log 11.1707 458 1.2386 0.1479 1.2386 1.1129
No log 11.2195 460 1.1995 0.1685 1.1995 1.0952
No log 11.2683 462 1.1978 0.1795 1.1978 1.0944
No log 11.3171 464 1.2746 0.1703 1.2746 1.1290
No log 11.3659 466 1.2889 0.1989 1.2889 1.1353
No log 11.4146 468 1.3078 0.2781 1.3078 1.1436
No log 11.4634 470 1.2725 0.2589 1.2725 1.1281
No log 11.5122 472 1.2114 0.1989 1.2114 1.1006
No log 11.5610 474 1.2118 0.1989 1.2118 1.1008
No log 11.6098 476 1.2122 0.1989 1.2122 1.1010
No log 11.6585 478 1.2081 0.1989 1.2081 1.0992
No log 11.7073 480 1.1991 0.2141 1.1991 1.0950
No log 11.7561 482 1.3056 0.1796 1.3056 1.1426
No log 11.8049 484 1.3775 0.2632 1.3775 1.1737
No log 11.8537 486 1.3396 0.2105 1.3396 1.1574
No log 11.9024 488 1.2623 0.2317 1.2623 1.1235
No log 11.9512 490 1.2053 0.1944 1.2053 1.0979
No log 12.0 492 1.1735 0.1795 1.1735 1.0833
No log 12.0488 494 1.2347 0.1219 1.2347 1.1112
No log 12.0976 496 1.2687 0.1536 1.2687 1.1264
No log 12.1463 498 1.2768 0.2445 1.2768 1.1300
0.3335 12.1951 500 1.2305 0.1646 1.2305 1.1093
0.3335 12.2439 502 1.2010 0.2291 1.2010 1.0959
0.3335 12.2927 504 1.2120 0.2291 1.2120 1.1009
0.3335 12.3415 506 1.2936 0.2317 1.2936 1.1374
0.3335 12.3902 508 1.2458 0.2291 1.2458 1.1161
0.3335 12.4390 510 1.2043 0.2000 1.2043 1.0974
0.3335 12.4878 512 1.1664 0.2163 1.1664 1.0800
0.3335 12.5366 514 1.1769 0.2163 1.1769 1.0848
0.3335 12.5854 516 1.2791 0.1345 1.2791 1.1310
0.3335 12.6341 518 1.3757 0.0851 1.3757 1.1729
0.3335 12.6829 520 1.3788 0.0851 1.3788 1.1742
0.3335 12.7317 522 1.2792 0.1795 1.2792 1.1310
0.3335 12.7805 524 1.1816 0.1582 1.1816 1.0870
0.3335 12.8293 526 1.2043 0.1582 1.2043 1.0974
0.3335 12.8780 528 1.2470 0.1582 1.2470 1.1167
0.3335 12.9268 530 1.2644 0.1247 1.2644 1.1244
0.3335 12.9756 532 1.2571 0.1247 1.2571 1.1212

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.4.0+cu118
  • Datasets 2.21.0
  • Tokenizers 0.19.1
Downloads last month
1
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for MayBashendy/ArabicNewSplits7_usingWellWrittenEssays_FineTuningAraBERT_run3_AugV5_k14_task2_organization

Finetuned
(4019)
this model