ArabicNewSplits7_B_usingALLEssays_FineTuningAraBERT_run2_AugV5_k4_task2_organization

This model is a fine-tuned version of aubmindlab/bert-base-arabertv02 on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 1.1286
  • Qwk: 0.3009
  • Mse: 1.1286
  • Rmse: 1.0623

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 100

Training results

Training Loss Epoch Step Validation Loss Qwk Mse Rmse
No log 0.1 2 4.7070 -0.0104 4.7070 2.1696
No log 0.2 4 2.7250 -0.0600 2.7250 1.6508
No log 0.3 6 1.6921 0.0372 1.6921 1.3008
No log 0.4 8 1.3609 0.0598 1.3609 1.1666
No log 0.5 10 1.5130 -0.0264 1.5130 1.2301
No log 0.6 12 1.4081 -0.0729 1.4081 1.1867
No log 0.7 14 1.4277 0.0346 1.4277 1.1949
No log 0.8 16 1.3100 0.0847 1.3100 1.1446
No log 0.9 18 1.4038 -0.0199 1.4038 1.1848
No log 1.0 20 1.5912 0.0449 1.5912 1.2614
No log 1.1 22 1.4255 0.0894 1.4255 1.1939
No log 1.2 24 1.2644 0.1045 1.2644 1.1245
No log 1.3 26 1.2800 0.1999 1.2800 1.1314
No log 1.4 28 1.2336 0.1791 1.2336 1.1107
No log 1.5 30 1.2574 0.1448 1.2574 1.1213
No log 1.6 32 1.6318 0.1417 1.6318 1.2774
No log 1.7 34 1.6776 0.1428 1.6776 1.2952
No log 1.8 36 1.3900 0.1316 1.3900 1.1790
No log 1.9 38 1.3851 0.1877 1.3851 1.1769
No log 2.0 40 1.8306 0.1758 1.8306 1.3530
No log 2.1 42 2.1135 0.0962 2.1135 1.4538
No log 2.2 44 1.7009 0.1630 1.7009 1.3042
No log 2.3 46 1.2800 0.1658 1.2800 1.1314
No log 2.4 48 1.1586 0.2068 1.1586 1.0764
No log 2.5 50 1.1425 0.2010 1.1425 1.0689
No log 2.6 52 1.1164 0.2526 1.1164 1.0566
No log 2.7 54 1.1200 0.2424 1.1200 1.0583
No log 2.8 56 1.2036 0.1999 1.2036 1.0971
No log 2.9 58 1.2836 0.1912 1.2836 1.1330
No log 3.0 60 1.5784 0.1831 1.5784 1.2563
No log 3.1 62 1.6965 0.2115 1.6965 1.3025
No log 3.2 64 1.4220 0.2690 1.4220 1.1925
No log 3.3 66 1.3315 0.2007 1.3315 1.1539
No log 3.4 68 1.4671 0.1993 1.4671 1.2113
No log 3.5 70 1.5095 0.2692 1.5095 1.2286
No log 3.6 72 1.3021 0.1971 1.3021 1.1411
No log 3.7 74 1.1576 0.3441 1.1576 1.0759
No log 3.8 76 1.1513 0.3421 1.1513 1.0730
No log 3.9 78 1.2164 0.2797 1.2164 1.1029
No log 4.0 80 1.2019 0.3569 1.2019 1.0963
No log 4.1 82 1.1087 0.4126 1.1087 1.0530
No log 4.2 84 1.1216 0.3953 1.1216 1.0591
No log 4.3 86 1.1516 0.3918 1.1516 1.0731
No log 4.4 88 1.1588 0.3976 1.1588 1.0765
No log 4.5 90 1.2706 0.4110 1.2706 1.1272
No log 4.6 92 1.5710 0.2720 1.5710 1.2534
No log 4.7 94 1.7474 0.2755 1.7474 1.3219
No log 4.8 96 1.4898 0.2688 1.4898 1.2206
No log 4.9 98 1.2882 0.3661 1.2882 1.1350
No log 5.0 100 1.3701 0.3023 1.3701 1.1705
No log 5.1 102 1.4070 0.3023 1.4070 1.1862
No log 5.2 104 1.3923 0.3023 1.3923 1.1800
No log 5.3 106 1.4097 0.3296 1.4097 1.1873
No log 5.4 108 1.2694 0.3654 1.2694 1.1267
No log 5.5 110 1.1916 0.3343 1.1916 1.0916
No log 5.6 112 1.1884 0.3343 1.1884 1.0902
No log 5.7 114 1.2572 0.3496 1.2572 1.1212
No log 5.8 116 1.6129 0.2182 1.6129 1.2700
No log 5.9 118 1.5846 0.2182 1.5846 1.2588
No log 6.0 120 1.3788 0.3289 1.3788 1.1742
No log 6.1 122 1.3486 0.3287 1.3486 1.1613
No log 6.2 124 1.4975 0.2182 1.4975 1.2237
No log 6.3 126 1.5875 0.2182 1.5875 1.2600
No log 6.4 128 1.5451 0.2111 1.5451 1.2430
No log 6.5 130 1.4329 0.3166 1.4329 1.1970
No log 6.6 132 1.2409 0.3385 1.2409 1.1140
No log 6.7 134 1.2183 0.3568 1.2183 1.1038
No log 6.8 136 1.3668 0.3454 1.3668 1.1691
No log 6.9 138 1.5080 0.3103 1.5080 1.2280
No log 7.0 140 1.3233 0.4016 1.3233 1.1503
No log 7.1 142 1.1917 0.4177 1.1917 1.0917
No log 7.2 144 1.1788 0.3411 1.1788 1.0857
No log 7.3 146 1.2417 0.3797 1.2417 1.1143
No log 7.4 148 1.6276 0.2375 1.6276 1.2758
No log 7.5 150 1.7992 0.2202 1.7992 1.3413
No log 7.6 152 1.4794 0.2896 1.4794 1.2163
No log 7.7 154 1.1690 0.3541 1.1690 1.0812
No log 7.8 156 1.1465 0.2907 1.1465 1.0707
No log 7.9 158 1.1459 0.3227 1.1459 1.0704
No log 8.0 160 1.2939 0.4078 1.2939 1.1375
No log 8.1 162 1.3937 0.3816 1.3937 1.1806
No log 8.2 164 1.2714 0.4232 1.2714 1.1276
No log 8.3 166 1.2130 0.4086 1.2130 1.1014
No log 8.4 168 1.2025 0.3300 1.2025 1.0966
No log 8.5 170 1.1659 0.3328 1.1659 1.0798
No log 8.6 172 1.1623 0.3692 1.1623 1.0781
No log 8.7 174 1.2963 0.3197 1.2963 1.1386
No log 8.8 176 1.3283 0.3719 1.3283 1.1525
No log 8.9 178 1.1723 0.2069 1.1723 1.0827
No log 9.0 180 1.0998 0.1914 1.0998 1.0487
No log 9.1 182 1.1027 0.3083 1.1027 1.0501
No log 9.2 184 1.0943 0.2188 1.0943 1.0461
No log 9.3 186 1.2427 0.3316 1.2427 1.1147
No log 9.4 188 1.2675 0.3119 1.2675 1.1258
No log 9.5 190 1.2132 0.3898 1.2132 1.1015
No log 9.6 192 1.3180 0.3298 1.3180 1.1480
No log 9.7 194 1.3333 0.3097 1.3333 1.1547
No log 9.8 196 1.1545 0.4157 1.1545 1.0745
No log 9.9 198 1.1131 0.3707 1.1131 1.0551
No log 10.0 200 1.1343 0.4211 1.1343 1.0650
No log 10.1 202 1.2042 0.3545 1.2042 1.0974
No log 10.2 204 1.3913 0.3234 1.3913 1.1795
No log 10.3 206 1.3159 0.3773 1.3159 1.1471
No log 10.4 208 1.1880 0.4173 1.1880 1.0899
No log 10.5 210 1.1791 0.4327 1.1791 1.0859
No log 10.6 212 1.2161 0.4024 1.2161 1.1028
No log 10.7 214 1.1880 0.4065 1.1880 1.0900
No log 10.8 216 1.1862 0.3982 1.1862 1.0891
No log 10.9 218 1.1086 0.2546 1.1086 1.0529
No log 11.0 220 1.1074 0.2546 1.1074 1.0523
No log 11.1 222 1.1120 0.2546 1.1120 1.0545
No log 11.2 224 1.1840 0.3459 1.1840 1.0881
No log 11.3 226 1.2379 0.3259 1.2379 1.1126
No log 11.4 228 1.3876 0.3446 1.3876 1.1780
No log 11.5 230 1.7278 0.2605 1.7278 1.3145
No log 11.6 232 1.7931 0.2424 1.7931 1.3390
No log 11.7 234 1.4954 0.2953 1.4954 1.2229
No log 11.8 236 1.2833 0.4039 1.2833 1.1328
No log 11.9 238 1.2434 0.3783 1.2434 1.1151
No log 12.0 240 1.2134 0.3935 1.2134 1.1015
No log 12.1 242 1.2447 0.4067 1.2447 1.1157
No log 12.2 244 1.1905 0.3777 1.1905 1.0911
No log 12.3 246 1.1324 0.3059 1.1324 1.0641
No log 12.4 248 1.1131 0.3016 1.1131 1.0550
No log 12.5 250 1.1105 0.3407 1.1105 1.0538
No log 12.6 252 1.1378 0.3407 1.1378 1.0667
No log 12.7 254 1.1775 0.3717 1.1775 1.0851
No log 12.8 256 1.3062 0.3520 1.3062 1.1429
No log 12.9 258 1.4926 0.3003 1.4926 1.2217
No log 13.0 260 1.4327 0.2704 1.4327 1.1969
No log 13.1 262 1.2681 0.3889 1.2681 1.1261
No log 13.2 264 1.1832 0.3424 1.1832 1.0878
No log 13.3 266 1.2144 0.3542 1.2144 1.1020
No log 13.4 268 1.2435 0.3859 1.2435 1.1151
No log 13.5 270 1.1833 0.3542 1.1833 1.0878
No log 13.6 272 1.1084 0.3321 1.1084 1.0528
No log 13.7 274 1.1024 0.3666 1.1024 1.0499
No log 13.8 276 1.1312 0.3971 1.1312 1.0636
No log 13.9 278 1.1884 0.3670 1.1884 1.0901
No log 14.0 280 1.1821 0.3670 1.1821 1.0873
No log 14.1 282 1.1273 0.3888 1.1273 1.0617
No log 14.2 284 1.1278 0.3819 1.1278 1.0620
No log 14.3 286 1.1952 0.3605 1.1952 1.0933
No log 14.4 288 1.1891 0.2943 1.1891 1.0905
No log 14.5 290 1.1349 0.3363 1.1349 1.0653
No log 14.6 292 1.0674 0.3371 1.0674 1.0331
No log 14.7 294 1.0188 0.2776 1.0188 1.0093
No log 14.8 296 1.0220 0.3103 1.0220 1.0110
No log 14.9 298 1.0865 0.3987 1.0865 1.0423
No log 15.0 300 1.1619 0.4059 1.1619 1.0779
No log 15.1 302 1.3806 0.3086 1.3806 1.1750
No log 15.2 304 1.4401 0.3051 1.4401 1.2000
No log 15.3 306 1.2873 0.3515 1.2873 1.1346
No log 15.4 308 1.0942 0.3243 1.0942 1.0460
No log 15.5 310 1.0846 0.3750 1.0846 1.0414
No log 15.6 312 1.1004 0.2995 1.1004 1.0490
No log 15.7 314 1.1402 0.2371 1.1402 1.0678
No log 15.8 316 1.3987 0.3133 1.3987 1.1827
No log 15.9 318 1.9210 0.1872 1.9210 1.3860
No log 16.0 320 2.2609 0.1599 2.2609 1.5036
No log 16.1 322 2.1798 0.1890 2.1798 1.4764
No log 16.2 324 1.7556 0.3136 1.7556 1.3250
No log 16.3 326 1.4814 0.2838 1.4814 1.2171
No log 16.4 328 1.3448 0.3627 1.3448 1.1596
No log 16.5 330 1.2827 0.3893 1.2827 1.1325
No log 16.6 332 1.2708 0.3738 1.2708 1.1273
No log 16.7 334 1.2910 0.3147 1.2910 1.1362
No log 16.8 336 1.2454 0.3172 1.2454 1.1160
No log 16.9 338 1.1974 0.3176 1.1974 1.0943
No log 17.0 340 1.1886 0.3434 1.1886 1.0902
No log 17.1 342 1.2232 0.3389 1.2232 1.1060
No log 17.2 344 1.2466 0.3210 1.2466 1.1165
No log 17.3 346 1.1825 0.3434 1.1825 1.0874
No log 17.4 348 1.1546 0.3359 1.1546 1.0745
No log 17.5 350 1.1751 0.3927 1.1751 1.0840
No log 17.6 352 1.2694 0.3217 1.2694 1.1267
No log 17.7 354 1.3849 0.2967 1.3849 1.1768
No log 17.8 356 1.3166 0.3304 1.3166 1.1475
No log 17.9 358 1.2349 0.3359 1.2349 1.1113
No log 18.0 360 1.1796 0.3483 1.1796 1.0861
No log 18.1 362 1.1655 0.3368 1.1655 1.0796
No log 18.2 364 1.1460 0.3123 1.1460 1.0705
No log 18.3 366 1.1259 0.2663 1.1259 1.0611
No log 18.4 368 1.1268 0.2411 1.1268 1.0615
No log 18.5 370 1.1525 0.3218 1.1525 1.0735
No log 18.6 372 1.2150 0.3531 1.2150 1.1023
No log 18.7 374 1.3686 0.3284 1.3686 1.1699
No log 18.8 376 1.4507 0.3607 1.4507 1.2045
No log 18.9 378 1.4167 0.3391 1.4167 1.1902
No log 19.0 380 1.4581 0.3537 1.4581 1.2075
No log 19.1 382 1.4795 0.3638 1.4795 1.2163
No log 19.2 384 1.4483 0.3423 1.4483 1.2035
No log 19.3 386 1.3297 0.3041 1.3297 1.1531
No log 19.4 388 1.2193 0.3728 1.2193 1.1042
No log 19.5 390 1.1983 0.3429 1.1983 1.0947
No log 19.6 392 1.1887 0.2926 1.1887 1.0903
No log 19.7 394 1.2168 0.3694 1.2168 1.1031
No log 19.8 396 1.2818 0.3466 1.2818 1.1322
No log 19.9 398 1.2588 0.4290 1.2588 1.1220
No log 20.0 400 1.1926 0.3421 1.1926 1.0921
No log 20.1 402 1.1744 0.3816 1.1744 1.0837
No log 20.2 404 1.1878 0.3816 1.1878 1.0898
No log 20.3 406 1.2807 0.4186 1.2807 1.1317
No log 20.4 408 1.4123 0.2550 1.4123 1.1884
No log 20.5 410 1.4231 0.2350 1.4231 1.1929
No log 20.6 412 1.3053 0.3956 1.3053 1.1425
No log 20.7 414 1.2036 0.3378 1.2036 1.0971
No log 20.8 416 1.1809 0.3338 1.1809 1.0867
No log 20.9 418 1.1668 0.3568 1.1668 1.0802
No log 21.0 420 1.2005 0.3682 1.2005 1.0957
No log 21.1 422 1.1701 0.3506 1.1701 1.0817
No log 21.2 424 1.1757 0.3418 1.1757 1.0843
No log 21.3 426 1.1591 0.3768 1.1591 1.0766
No log 21.4 428 1.1055 0.4007 1.1055 1.0514
No log 21.5 430 1.0700 0.3595 1.0700 1.0344
No log 21.6 432 1.0745 0.3595 1.0745 1.0366
No log 21.7 434 1.0881 0.3620 1.0881 1.0431
No log 21.8 436 1.0972 0.4007 1.0972 1.0475
No log 21.9 438 1.0737 0.3368 1.0737 1.0362
No log 22.0 440 1.0703 0.3099 1.0703 1.0345
No log 22.1 442 1.0648 0.3099 1.0648 1.0319
No log 22.2 444 1.0365 0.2989 1.0365 1.0181
No log 22.3 446 1.0314 0.2680 1.0314 1.0156
No log 22.4 448 1.0506 0.3968 1.0506 1.0250
No log 22.5 450 1.1159 0.4512 1.1159 1.0564
No log 22.6 452 1.0899 0.4481 1.0899 1.0440
No log 22.7 454 1.0333 0.3601 1.0333 1.0165
No log 22.8 456 1.0117 0.3536 1.0117 1.0058
No log 22.9 458 1.0002 0.3897 1.0002 1.0001
No log 23.0 460 1.0130 0.3590 1.0130 1.0065
No log 23.1 462 1.0246 0.3888 1.0246 1.0122
No log 23.2 464 1.0304 0.3888 1.0304 1.0151
No log 23.3 466 1.0286 0.3199 1.0286 1.0142
No log 23.4 468 1.0401 0.4228 1.0401 1.0199
No log 23.5 470 1.0628 0.3711 1.0628 1.0309
No log 23.6 472 1.1045 0.3346 1.1045 1.0510
No log 23.7 474 1.1210 0.3672 1.1210 1.0588
No log 23.8 476 1.0804 0.2534 1.0804 1.0394
No log 23.9 478 1.0546 0.2966 1.0546 1.0269
No log 24.0 480 1.0392 0.3208 1.0392 1.0194
No log 24.1 482 1.0343 0.3441 1.0343 1.0170
No log 24.2 484 1.0168 0.4712 1.0168 1.0083
No log 24.3 486 0.9884 0.4435 0.9884 0.9942
No log 24.4 488 0.9803 0.4250 0.9803 0.9901
No log 24.5 490 0.9929 0.4005 0.9929 0.9965
No log 24.6 492 0.9818 0.4430 0.9818 0.9909
No log 24.7 494 1.0011 0.3834 1.0011 1.0006
No log 24.8 496 1.0563 0.3819 1.0563 1.0278
No log 24.9 498 1.0573 0.3819 1.0573 1.0283
0.3133 25.0 500 1.0040 0.3803 1.0040 1.0020
0.3133 25.1 502 1.0091 0.3745 1.0091 1.0045
0.3133 25.2 504 1.0203 0.3601 1.0203 1.0101
0.3133 25.3 506 1.0725 0.3740 1.0725 1.0356
0.3133 25.4 508 1.1520 0.4106 1.1520 1.0733
0.3133 25.5 510 1.2008 0.4590 1.2008 1.0958
0.3133 25.6 512 1.2021 0.4370 1.2021 1.0964
0.3133 25.7 514 1.1371 0.4307 1.1371 1.0664
0.3133 25.8 516 1.0849 0.4013 1.0849 1.0416
0.3133 25.9 518 1.0759 0.3677 1.0759 1.0373
0.3133 26.0 520 1.0873 0.3753 1.0873 1.0427
0.3133 26.1 522 1.0814 0.3328 1.0814 1.0399
0.3133 26.2 524 1.0768 0.2949 1.0768 1.0377
0.3133 26.3 526 1.1286 0.3009 1.1286 1.0623

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.4.0+cu118
  • Datasets 2.21.0
  • Tokenizers 0.19.1
Downloads last month
-
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for MayBashendy/ArabicNewSplits7_B_usingALLEssays_FineTuningAraBERT_run2_AugV5_k4_task2_organization

Finetuned
(4032)
this model