ArabicNewSplits7_B_usingALLEssays_FineTuningAraBERT_run2_AugV5_k7_task2_organization

This model is a fine-tuned version of aubmindlab/bert-base-arabertv02 on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 1.2810
  • Qwk: 0.2507
  • Mse: 1.2810
  • Rmse: 1.1318

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 100

Training results

Training Loss Epoch Step Validation Loss Qwk Mse Rmse
No log 0.0571 2 4.7097 0.0010 4.7097 2.1702
No log 0.1143 4 2.6468 0.0159 2.6468 1.6269
No log 0.1714 6 1.8570 0.0062 1.8570 1.3627
No log 0.2286 8 1.7232 -0.0157 1.7232 1.3127
No log 0.2857 10 1.5827 -0.1038 1.5827 1.2581
No log 0.3429 12 1.9735 -0.1442 1.9735 1.4048
No log 0.4 14 2.5891 -0.0827 2.5891 1.6091
No log 0.4571 16 1.9572 -0.0213 1.9572 1.3990
No log 0.5143 18 1.4148 -0.0614 1.4148 1.1894
No log 0.5714 20 1.3637 -0.0963 1.3637 1.1678
No log 0.6286 22 1.3844 -0.0391 1.3844 1.1766
No log 0.6857 24 1.5414 0.0082 1.5414 1.2415
No log 0.7429 26 2.0251 0.0372 2.0251 1.4231
No log 0.8 28 2.2208 0.0971 2.2208 1.4902
No log 0.8571 30 1.9781 0.0629 1.9781 1.4065
No log 0.9143 32 1.7062 0.0363 1.7062 1.3062
No log 0.9714 34 1.4075 0.2162 1.4075 1.1864
No log 1.0286 36 1.1721 0.2068 1.1721 1.0826
No log 1.0857 38 1.3060 0.2005 1.3060 1.1428
No log 1.1429 40 1.4791 0.1822 1.4791 1.2162
No log 1.2 42 1.5483 0.2148 1.5483 1.2443
No log 1.2571 44 1.5664 0.2068 1.5664 1.2515
No log 1.3143 46 1.4369 0.2130 1.4369 1.1987
No log 1.3714 48 2.0296 0.1055 2.0296 1.4246
No log 1.4286 50 2.8479 0.0484 2.8479 1.6876
No log 1.4857 52 3.2002 0.0524 3.2002 1.7889
No log 1.5429 54 2.5801 0.0974 2.5801 1.6063
No log 1.6 56 1.6426 0.2083 1.6426 1.2816
No log 1.6571 58 1.2602 0.2609 1.2602 1.1226
No log 1.7143 60 1.2091 0.2605 1.2091 1.0996
No log 1.7714 62 1.4641 0.1015 1.4641 1.2100
No log 1.8286 64 1.8409 0.1601 1.8409 1.3568
No log 1.8857 66 1.7361 0.1223 1.7361 1.3176
No log 1.9429 68 1.3615 0.1404 1.3615 1.1668
No log 2.0 70 1.1746 0.1793 1.1746 1.0838
No log 2.0571 72 1.2514 0.2574 1.2514 1.1187
No log 2.1143 74 1.2256 0.1505 1.2256 1.1071
No log 2.1714 76 1.1765 0.2628 1.1765 1.0847
No log 2.2286 78 1.1976 0.1927 1.1976 1.0943
No log 2.2857 80 1.3654 0.0600 1.3654 1.1685
No log 2.3429 82 1.6656 0.0145 1.6656 1.2906
No log 2.4 84 1.6774 0.0504 1.6774 1.2951
No log 2.4571 86 1.4112 0.1051 1.4112 1.1880
No log 2.5143 88 1.1920 0.1943 1.1920 1.0918
No log 2.5714 90 1.1496 0.2223 1.1496 1.0722
No log 2.6286 92 1.1834 0.2315 1.1834 1.0879
No log 2.6857 94 1.4289 0.3434 1.4289 1.1954
No log 2.7429 96 2.1735 0.1484 2.1735 1.4743
No log 2.8 98 2.6424 0.1007 2.6424 1.6255
No log 2.8571 100 2.4518 0.1401 2.4518 1.5658
No log 2.9143 102 1.8468 0.2333 1.8468 1.3590
No log 2.9714 104 1.3376 0.2822 1.3376 1.1565
No log 3.0286 106 1.1475 0.3037 1.1475 1.0712
No log 3.0857 108 1.1192 0.2898 1.1192 1.0579
No log 3.1429 110 1.1265 0.2605 1.1265 1.0614
No log 3.2 112 1.3805 0.3025 1.3805 1.1750
No log 3.2571 114 1.5030 0.2333 1.5030 1.2260
No log 3.3143 116 1.4134 0.2634 1.4134 1.1889
No log 3.3714 118 1.2158 0.1996 1.2158 1.1026
No log 3.4286 120 1.1238 0.2417 1.1238 1.0601
No log 3.4857 122 1.1392 0.2071 1.1392 1.0673
No log 3.5429 124 1.1689 0.2315 1.1689 1.0812
No log 3.6 126 1.2588 0.2002 1.2588 1.1220
No log 3.6571 128 1.5260 0.2642 1.5260 1.2353
No log 3.7143 130 1.6884 0.2530 1.6884 1.2994
No log 3.7714 132 1.5823 0.2530 1.5823 1.2579
No log 3.8286 134 1.4635 0.2929 1.4635 1.2098
No log 3.8857 136 1.3999 0.2748 1.3999 1.1832
No log 3.9429 138 1.2560 0.3670 1.2560 1.1207
No log 4.0 140 1.2123 0.4428 1.2123 1.1011
No log 4.0571 142 1.2223 0.4208 1.2223 1.1056
No log 4.1143 144 1.2571 0.3172 1.2571 1.1212
No log 4.1714 146 1.4368 0.3073 1.4368 1.1987
No log 4.2286 148 1.5755 0.2622 1.5755 1.2552
No log 4.2857 150 1.4120 0.3146 1.4120 1.1883
No log 4.3429 152 1.1837 0.3056 1.1837 1.0880
No log 4.4 154 1.1527 0.2898 1.1527 1.0737
No log 4.4571 156 1.1807 0.2824 1.1807 1.0866
No log 4.5143 158 1.3536 0.3176 1.3536 1.1635
No log 4.5714 160 1.4830 0.2071 1.4830 1.2178
No log 4.6286 162 1.5973 0.2174 1.5973 1.2638
No log 4.6857 164 1.6025 0.2174 1.6025 1.2659
No log 4.7429 166 1.5637 0.1862 1.5637 1.2505
No log 4.8 168 1.3880 0.3290 1.3880 1.1781
No log 4.8571 170 1.2286 0.3030 1.2286 1.1084
No log 4.9143 172 1.2694 0.3110 1.2694 1.1267
No log 4.9714 174 1.3363 0.2944 1.3363 1.1560
No log 5.0286 176 1.4568 0.3212 1.4568 1.2070
No log 5.0857 178 1.5144 0.2507 1.5144 1.2306
No log 5.1429 180 1.4218 0.3290 1.4218 1.1924
No log 5.2 182 1.3186 0.3309 1.3186 1.1483
No log 5.2571 184 1.2924 0.3027 1.2924 1.1368
No log 5.3143 186 1.3772 0.2724 1.3772 1.1736
No log 5.3714 188 1.4666 0.2436 1.4666 1.2110
No log 5.4286 190 1.3285 0.3259 1.3285 1.1526
No log 5.4857 192 1.2173 0.3289 1.2173 1.1033
No log 5.5429 194 1.2065 0.2504 1.2065 1.0984
No log 5.6 196 1.2854 0.3202 1.2854 1.1337
No log 5.6571 198 1.4893 0.2550 1.4893 1.2204
No log 5.7143 200 1.4768 0.2331 1.4768 1.2152
No log 5.7714 202 1.3131 0.2919 1.3131 1.1459
No log 5.8286 204 1.1724 0.2620 1.1724 1.0828
No log 5.8857 206 1.1732 0.2466 1.1732 1.0832
No log 5.9429 208 1.2358 0.2710 1.2358 1.1117
No log 6.0 210 1.5020 0.2328 1.5020 1.2256
No log 6.0571 212 1.9030 0.1267 1.9030 1.3795
No log 6.1143 214 2.0616 0.1358 2.0616 1.4358
No log 6.1714 216 2.0585 0.1358 2.0585 1.4348
No log 6.2286 218 1.9369 0.1144 1.9369 1.3917
No log 6.2857 220 1.6217 0.2041 1.6217 1.2734
No log 6.3429 222 1.3462 0.2316 1.3462 1.1602
No log 6.4 224 1.2666 0.2035 1.2666 1.1254
No log 6.4571 226 1.2594 0.2035 1.2594 1.1222
No log 6.5143 228 1.3277 0.2181 1.3277 1.1522
No log 6.5714 230 1.5436 0.2628 1.5436 1.2424
No log 6.6286 232 1.7389 0.1941 1.7389 1.3187
No log 6.6857 234 1.7651 0.2138 1.7651 1.3286
No log 6.7429 236 1.6277 0.1777 1.6277 1.2758
No log 6.8 238 1.5587 0.1645 1.5587 1.2485
No log 6.8571 240 1.4681 0.2162 1.4681 1.2117
No log 6.9143 242 1.5102 0.2123 1.5102 1.2289
No log 6.9714 244 1.6861 0.1658 1.6861 1.2985
No log 7.0286 246 1.6886 0.1592 1.6886 1.2995
No log 7.0857 248 1.6247 0.1776 1.6247 1.2746
No log 7.1429 250 1.5349 0.1520 1.5349 1.2389
No log 7.2 252 1.3713 0.2549 1.3713 1.1710
No log 7.2571 254 1.2708 0.1354 1.2708 1.1273
No log 7.3143 256 1.2447 0.1446 1.2447 1.1156
No log 7.3714 258 1.2945 0.2098 1.2945 1.1377
No log 7.4286 260 1.4649 0.1884 1.4649 1.2103
No log 7.4857 262 1.6312 0.2158 1.6312 1.2772
No log 7.5429 264 1.6353 0.2158 1.6353 1.2788
No log 7.6 266 1.4489 0.2534 1.4489 1.2037
No log 7.6571 268 1.2658 0.1950 1.2658 1.1251
No log 7.7143 270 1.2380 0.1589 1.2380 1.1126
No log 7.7714 272 1.2793 0.1606 1.2793 1.1311
No log 7.8286 274 1.3497 0.1541 1.3497 1.1618
No log 7.8857 276 1.4517 0.2767 1.4517 1.2048
No log 7.9429 278 1.6667 0.2212 1.6667 1.2910
No log 8.0 280 1.9571 0.1920 1.9571 1.3990
No log 8.0571 282 1.9219 0.1691 1.9219 1.3863
No log 8.1143 284 1.6812 0.1795 1.6812 1.2966
No log 8.1714 286 1.4019 0.2004 1.4019 1.1840
No log 8.2286 288 1.2931 0.2311 1.2931 1.1371
No log 8.2857 290 1.2589 0.2311 1.2589 1.1220
No log 8.3429 292 1.2454 0.2311 1.2454 1.1160
No log 8.4 294 1.2920 0.2320 1.2920 1.1367
No log 8.4571 296 1.4462 0.2730 1.4462 1.2026
No log 8.5143 298 1.4866 0.2338 1.4866 1.2192
No log 8.5714 300 1.3973 0.2596 1.3973 1.1821
No log 8.6286 302 1.3348 0.3067 1.3348 1.1553
No log 8.6857 304 1.2426 0.2492 1.2426 1.1147
No log 8.7429 306 1.2577 0.2570 1.2577 1.1215
No log 8.8 308 1.3199 0.2983 1.3199 1.1489
No log 8.8571 310 1.3059 0.2918 1.3059 1.1427
No log 8.9143 312 1.2961 0.2918 1.2961 1.1385
No log 8.9714 314 1.2853 0.2918 1.2853 1.1337
No log 9.0286 316 1.3395 0.3047 1.3395 1.1574
No log 9.0857 318 1.4566 0.3248 1.4566 1.2069
No log 9.1429 320 1.4738 0.3117 1.4738 1.2140
No log 9.2 322 1.2950 0.3166 1.2950 1.1380
No log 9.2571 324 1.2218 0.2529 1.2218 1.1053
No log 9.3143 326 1.2677 0.3303 1.2677 1.1259
No log 9.3714 328 1.2924 0.3503 1.2924 1.1368
No log 9.4286 330 1.3976 0.3390 1.3976 1.1822
No log 9.4857 332 1.3590 0.3482 1.3590 1.1658
No log 9.5429 334 1.2286 0.3227 1.2286 1.1084
No log 9.6 336 1.1979 0.3354 1.1979 1.0945
No log 9.6571 338 1.2856 0.3061 1.2856 1.1338
No log 9.7143 340 1.4721 0.2982 1.4721 1.2133
No log 9.7714 342 1.6947 0.2044 1.6947 1.3018
No log 9.8286 344 1.7064 0.2044 1.7064 1.3063
No log 9.8857 346 1.5562 0.3036 1.5562 1.2475
No log 9.9429 348 1.3506 0.3711 1.3506 1.1621
No log 10.0 350 1.2152 0.3475 1.2152 1.1024
No log 10.0571 352 1.1521 0.3029 1.1521 1.0733
No log 10.1143 354 1.1671 0.3462 1.1671 1.0803
No log 10.1714 356 1.2716 0.3171 1.2716 1.1276
No log 10.2286 358 1.5199 0.3333 1.5199 1.2328
No log 10.2857 360 1.6880 0.3016 1.6880 1.2992
No log 10.3429 362 1.6239 0.3016 1.6239 1.2743
No log 10.4 364 1.3784 0.3470 1.3784 1.1741
No log 10.4571 366 1.1772 0.3462 1.1772 1.0850
No log 10.5143 368 1.1263 0.3074 1.1263 1.0613
No log 10.5714 370 1.1648 0.3672 1.1648 1.0793
No log 10.6286 372 1.2608 0.3827 1.2608 1.1228
No log 10.6857 374 1.3102 0.3630 1.3102 1.1446
No log 10.7429 376 1.3955 0.3326 1.3955 1.1813
No log 10.8 378 1.3242 0.3907 1.3242 1.1507
No log 10.8571 380 1.2879 0.3667 1.2879 1.1348
No log 10.9143 382 1.3061 0.3826 1.3061 1.1428
No log 10.9714 384 1.3893 0.3403 1.3893 1.1787
No log 11.0286 386 1.4303 0.3403 1.4303 1.1959
No log 11.0857 388 1.3944 0.3482 1.3944 1.1808
No log 11.1429 390 1.3164 0.3537 1.3164 1.1473
No log 11.2 392 1.2672 0.3574 1.2672 1.1257
No log 11.2571 394 1.2325 0.3574 1.2325 1.1102
No log 11.3143 396 1.2563 0.3574 1.2563 1.1208
No log 11.3714 398 1.3760 0.3482 1.3760 1.1730
No log 11.4286 400 1.3887 0.3482 1.3887 1.1784
No log 11.4857 402 1.3118 0.3466 1.3118 1.1453
No log 11.5429 404 1.2977 0.3621 1.2977 1.1392
No log 11.6 406 1.3505 0.3658 1.3505 1.1621
No log 11.6571 408 1.4306 0.3275 1.4306 1.1961
No log 11.7143 410 1.3665 0.3503 1.3665 1.1690
No log 11.7714 412 1.3914 0.3159 1.3914 1.1796
No log 11.8286 414 1.3368 0.3402 1.3368 1.1562
No log 11.8857 416 1.2262 0.3070 1.2262 1.1074
No log 11.9429 418 1.1359 0.2964 1.1359 1.0658
No log 12.0 420 1.0921 0.1975 1.0921 1.0450
No log 12.0571 422 1.1093 0.2685 1.1093 1.0532
No log 12.1143 424 1.2199 0.2939 1.2199 1.1045
No log 12.1714 426 1.4247 0.2733 1.4247 1.1936
No log 12.2286 428 1.5341 0.1831 1.5341 1.2386
No log 12.2857 430 1.4768 0.2394 1.4768 1.2152
No log 12.3429 432 1.3601 0.2635 1.3601 1.1662
No log 12.4 434 1.3188 0.2983 1.3188 1.1484
No log 12.4571 436 1.2771 0.2918 1.2771 1.1301
No log 12.5143 438 1.2469 0.3047 1.2469 1.1166
No log 12.5714 440 1.1965 0.3160 1.1965 1.0939
No log 12.6286 442 1.0815 0.3008 1.0815 1.0399
No log 12.6857 444 1.0434 0.3448 1.0434 1.0215
No log 12.7429 446 1.0491 0.3082 1.0491 1.0243
No log 12.8 448 1.1064 0.3008 1.1064 1.0519
No log 12.8571 450 1.2002 0.3092 1.2002 1.0955
No log 12.9143 452 1.1920 0.2962 1.1920 1.0918
No log 12.9714 454 1.1390 0.2762 1.1390 1.0673
No log 13.0286 456 1.0743 0.3467 1.0743 1.0365
No log 13.0857 458 1.0671 0.3782 1.0671 1.0330
No log 13.1429 460 1.1274 0.3731 1.1274 1.0618
No log 13.2 462 1.2867 0.3281 1.2867 1.1343
No log 13.2571 464 1.3872 0.3076 1.3872 1.1778
No log 13.3143 466 1.3690 0.2964 1.3690 1.1700
No log 13.3714 468 1.2507 0.3307 1.2507 1.1184
No log 13.4286 470 1.1460 0.3389 1.1460 1.0705
No log 13.4857 472 1.1191 0.3330 1.1191 1.0579
No log 13.5429 474 1.1396 0.3144 1.1396 1.0675
No log 13.6 476 1.1867 0.3226 1.1867 1.0894
No log 13.6571 478 1.3124 0.3092 1.3124 1.1456
No log 13.7143 480 1.3760 0.3332 1.3760 1.1730
No log 13.7714 482 1.4174 0.2703 1.4174 1.1905
No log 13.8286 484 1.3888 0.3184 1.3888 1.1785
No log 13.8857 486 1.4563 0.2430 1.4563 1.2068
No log 13.9429 488 1.5305 0.25 1.5305 1.2371
No log 14.0 490 1.5150 0.2353 1.5150 1.2309
No log 14.0571 492 1.3730 0.2201 1.3730 1.1717
No log 14.1143 494 1.1968 0.2565 1.1968 1.0940
No log 14.1714 496 1.1254 0.2374 1.1254 1.0608
No log 14.2286 498 1.1028 0.2796 1.1028 1.0502
0.3794 14.2857 500 1.1434 0.2597 1.1434 1.0693
0.3794 14.3429 502 1.2309 0.2961 1.2309 1.1095
0.3794 14.4 504 1.3412 0.3537 1.3412 1.1581
0.3794 14.4571 506 1.3767 0.2963 1.3767 1.1733
0.3794 14.5143 508 1.4188 0.2944 1.4188 1.1911
0.3794 14.5714 510 1.3905 0.2963 1.3905 1.1792
0.3794 14.6286 512 1.3281 0.3792 1.3281 1.1525
0.3794 14.6857 514 1.2580 0.3339 1.2580 1.1216
0.3794 14.7429 516 1.1796 0.3202 1.1796 1.0861
0.3794 14.8 518 1.1821 0.3184 1.1821 1.0872
0.3794 14.8571 520 1.2260 0.3395 1.2260 1.1072
0.3794 14.9143 522 1.2316 0.2798 1.2316 1.1098
0.3794 14.9714 524 1.2639 0.2922 1.2639 1.1242
0.3794 15.0286 526 1.2783 0.2922 1.2783 1.1306
0.3794 15.0857 528 1.3542 0.3184 1.3542 1.1637
0.3794 15.1429 530 1.4571 0.2447 1.4571 1.2071
0.3794 15.2 532 1.5495 0.2690 1.5495 1.2448
0.3794 15.2571 534 1.5844 0.2253 1.5844 1.2587
0.3794 15.3143 536 1.4717 0.2572 1.4717 1.2131
0.3794 15.3714 538 1.3182 0.2822 1.3182 1.1481
0.3794 15.4286 540 1.1880 0.2486 1.1880 1.0899
0.3794 15.4857 542 1.1522 0.2210 1.1522 1.0734
0.3794 15.5429 544 1.1700 0.2521 1.1700 1.0817
0.3794 15.6 546 1.2225 0.2230 1.2225 1.1057
0.3794 15.6571 548 1.2810 0.2507 1.2810 1.1318

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.4.0+cu118
  • Datasets 2.21.0
  • Tokenizers 0.19.1
Downloads last month
-
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for MayBashendy/ArabicNewSplits7_B_usingALLEssays_FineTuningAraBERT_run2_AugV5_k7_task2_organization

Finetuned
(4032)
this model