ArabicNewSplits7_usingWellWrittenEssays_FineTuningAraBERT_run3_AugV5_k10_task5_organization

This model is a fine-tuned version of aubmindlab/bert-base-arabertv02 on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 1.3705
  • Qwk: 0.2126
  • Mse: 1.3705
  • Rmse: 1.1707

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 100

Training results

Training Loss Epoch Step Validation Loss Qwk Mse Rmse
No log 0.0833 2 4.0352 -0.0008 4.0352 2.0088
No log 0.1667 4 2.5663 -0.0088 2.5663 1.6020
No log 0.25 6 1.7523 0.0435 1.7523 1.3237
No log 0.3333 8 1.1986 0.1255 1.1986 1.0948
No log 0.4167 10 1.0601 0.2663 1.0601 1.0296
No log 0.5 12 1.1813 -0.0075 1.1813 1.0869
No log 0.5833 14 1.2480 0.0021 1.2480 1.1171
No log 0.6667 16 1.1057 0.1062 1.1057 1.0515
No log 0.75 18 1.0794 0.2366 1.0794 1.0389
No log 0.8333 20 1.1134 0.2293 1.1134 1.0552
No log 0.9167 22 1.2043 0.0790 1.2043 1.0974
No log 1.0 24 1.1877 0.1205 1.1877 1.0898
No log 1.0833 26 1.1884 0.1618 1.1884 1.0902
No log 1.1667 28 1.5477 0.0399 1.5477 1.2440
No log 1.25 30 1.5679 0.0 1.5679 1.2522
No log 1.3333 32 1.2898 0.0996 1.2898 1.1357
No log 1.4167 34 1.0327 0.1521 1.0327 1.0162
No log 1.5 36 1.0051 0.1713 1.0051 1.0025
No log 1.5833 38 0.9994 0.1313 0.9994 0.9997
No log 1.6667 40 1.0764 0.0941 1.0764 1.0375
No log 1.75 42 1.0842 0.1537 1.0842 1.0412
No log 1.8333 44 0.9882 0.1685 0.9882 0.9941
No log 1.9167 46 0.9861 0.2746 0.9861 0.9930
No log 2.0 48 0.9850 0.1532 0.9850 0.9925
No log 2.0833 50 1.0110 0.2108 1.0110 1.0055
No log 2.1667 52 1.1221 0.1846 1.1221 1.0593
No log 2.25 54 1.1521 0.2040 1.1521 1.0734
No log 2.3333 56 1.0663 0.2895 1.0663 1.0326
No log 2.4167 58 1.0357 0.2574 1.0357 1.0177
No log 2.5 60 0.9391 0.2251 0.9391 0.9691
No log 2.5833 62 0.9399 0.2276 0.9399 0.9695
No log 2.6667 64 1.1240 0.2149 1.1240 1.0602
No log 2.75 66 1.3706 0.0636 1.3706 1.1707
No log 2.8333 68 1.3196 0.1986 1.3196 1.1488
No log 2.9167 70 1.1582 0.1893 1.1582 1.0762
No log 3.0 72 1.1241 0.2567 1.1241 1.0602
No log 3.0833 74 1.3890 0.2650 1.3890 1.1786
No log 3.1667 76 1.6219 0.1575 1.6219 1.2735
No log 3.25 78 1.3375 0.2729 1.3375 1.1565
No log 3.3333 80 1.0166 0.2636 1.0166 1.0083
No log 3.4167 82 1.0140 0.2505 1.0140 1.0070
No log 3.5 84 1.3260 0.2812 1.3260 1.1515
No log 3.5833 86 1.7867 0.0094 1.7867 1.3367
No log 3.6667 88 1.6855 0.0864 1.6855 1.2982
No log 3.75 90 1.2645 0.2795 1.2645 1.1245
No log 3.8333 92 0.9626 0.2167 0.9626 0.9811
No log 3.9167 94 0.9549 0.2577 0.9549 0.9772
No log 4.0 96 1.0026 0.2238 1.0026 1.0013
No log 4.0833 98 1.1935 0.2203 1.1935 1.0925
No log 4.1667 100 1.4696 -0.0122 1.4696 1.2123
No log 4.25 102 1.4692 0.0033 1.4692 1.2121
No log 4.3333 104 1.2665 0.2477 1.2665 1.1254
No log 4.4167 106 1.2112 0.2409 1.2112 1.1005
No log 4.5 108 1.3189 0.2195 1.3189 1.1484
No log 4.5833 110 1.4691 0.2588 1.4691 1.2120
No log 4.6667 112 1.5950 0.2511 1.5950 1.2629
No log 4.75 114 1.5122 0.2771 1.5122 1.2297
No log 4.8333 116 1.3074 0.2686 1.3074 1.1434
No log 4.9167 118 1.3081 0.2686 1.3081 1.1437
No log 5.0 120 1.5024 0.2292 1.5024 1.2257
No log 5.0833 122 1.6248 0.1902 1.6248 1.2747
No log 5.1667 124 1.5717 0.2342 1.5717 1.2537
No log 5.25 126 1.3994 0.2506 1.3994 1.1830
No log 5.3333 128 1.2788 0.2284 1.2788 1.1309
No log 5.4167 130 1.3171 0.2203 1.3171 1.1477
No log 5.5 132 1.3587 0.2203 1.3587 1.1656
No log 5.5833 134 1.4580 0.2555 1.4580 1.2075
No log 5.6667 136 1.4811 0.2555 1.4811 1.2170
No log 5.75 138 1.3859 0.1814 1.3859 1.1772
No log 5.8333 140 1.1880 0.2284 1.1880 1.0900
No log 5.9167 142 1.1414 0.1911 1.1414 1.0683
No log 6.0 144 1.2514 0.2730 1.2514 1.1187
No log 6.0833 146 1.5817 0.2123 1.5817 1.2577
No log 6.1667 148 1.6955 0.2317 1.6955 1.3021
No log 6.25 150 1.5632 0.2004 1.5632 1.2503
No log 6.3333 152 1.3770 0.2203 1.3770 1.1734
No log 6.4167 154 1.3231 0.2203 1.3231 1.1503
No log 6.5 156 1.2984 0.2143 1.2984 1.1395
No log 6.5833 158 1.3728 0.2455 1.3728 1.1717
No log 6.6667 160 1.3686 0.2506 1.3686 1.1699
No log 6.75 162 1.4668 0.2075 1.4668 1.2111
No log 6.8333 164 1.4634 0.2075 1.4634 1.2097
No log 6.9167 166 1.4068 0.2203 1.4068 1.1861
No log 7.0 168 1.4186 0.1886 1.4186 1.1911
No log 7.0833 170 1.4289 0.1886 1.4289 1.1954
No log 7.1667 172 1.5003 0.2075 1.5003 1.2249
No log 7.25 174 1.5719 0.2690 1.5719 1.2538
No log 7.3333 176 1.5040 0.2731 1.5040 1.2264
No log 7.4167 178 1.3429 0.2772 1.3429 1.1588
No log 7.5 180 1.4584 0.2731 1.4584 1.2076
No log 7.5833 182 1.7301 0.2159 1.7301 1.3153
No log 7.6667 184 1.7354 0.2317 1.7354 1.3173
No log 7.75 186 1.5629 0.2568 1.5629 1.2502
No log 7.8333 188 1.4024 0.2260 1.4024 1.1843
No log 7.9167 190 1.2967 0.1628 1.2967 1.1387
No log 8.0 192 1.2806 0.1628 1.2806 1.1316
No log 8.0833 194 1.3118 0.2260 1.3118 1.1454
No log 8.1667 196 1.3102 0.2555 1.3102 1.1446
No log 8.25 198 1.3176 0.2795 1.3176 1.1479
No log 8.3333 200 1.2959 0.2602 1.2959 1.1384
No log 8.4167 202 1.2153 0.2752 1.2153 1.1024
No log 8.5 204 1.1189 0.3018 1.1189 1.0578
No log 8.5833 206 1.1673 0.2752 1.1673 1.0804
No log 8.6667 208 1.4200 0.2126 1.4200 1.1916
No log 8.75 210 1.6396 0.2004 1.6396 1.2805
No log 8.8333 212 1.6236 0.1703 1.6236 1.2742
No log 8.9167 214 1.5935 0.1703 1.5935 1.2623
No log 9.0 216 1.6719 0.1832 1.6719 1.2930
No log 9.0833 218 1.6333 0.1832 1.6333 1.2780
No log 9.1667 220 1.5344 0.1880 1.5344 1.2387
No log 9.25 222 1.4003 0.2424 1.4003 1.1833
No log 9.3333 224 1.2399 0.2752 1.2399 1.1135
No log 9.4167 226 1.2175 0.1816 1.2175 1.1034
No log 9.5 228 1.2440 0.1407 1.2440 1.1154
No log 9.5833 230 1.3313 0.2474 1.3313 1.1538
No log 9.6667 232 1.3865 0.2342 1.3865 1.1775
No log 9.75 234 1.3563 0.2771 1.3563 1.1646
No log 9.8333 236 1.4861 0.2474 1.4861 1.2191
No log 9.9167 238 1.4028 0.2638 1.4028 1.1844
No log 10.0 240 1.3745 0.2638 1.3745 1.1724
No log 10.0833 242 1.3745 0.2771 1.3745 1.1724
No log 10.1667 244 1.3449 0.2793 1.3449 1.1597
No log 10.25 246 1.4653 0.2752 1.4653 1.2105
No log 10.3333 248 1.4147 0.2474 1.4147 1.1894
No log 10.4167 250 1.2635 0.2474 1.2635 1.1241
No log 10.5 252 1.0776 0.1202 1.0776 1.0381
No log 10.5833 254 1.0122 0.0618 1.0122 1.0061
No log 10.6667 256 1.0699 0.1351 1.0699 1.0344
No log 10.75 258 1.3102 0.2709 1.3102 1.1446
No log 10.8333 260 1.6145 0.2832 1.6145 1.2706
No log 10.9167 262 1.6913 0.2694 1.6913 1.3005
No log 11.0 264 1.5856 0.2522 1.5856 1.2592
No log 11.0833 266 1.4028 0.2474 1.4028 1.1844
No log 11.1667 268 1.2060 0.1552 1.2060 1.0982
No log 11.25 270 1.1804 0.1552 1.1804 1.0865
No log 11.3333 272 1.3121 0.2506 1.3121 1.1454
No log 11.4167 274 1.6497 0.2159 1.6497 1.2844
No log 11.5 276 1.8493 0.1822 1.8493 1.3599
No log 11.5833 278 1.8268 0.1718 1.8268 1.3516
No log 11.6667 280 1.6759 0.2611 1.6759 1.2946
No log 11.75 282 1.4560 0.1744 1.4560 1.2067
No log 11.8333 284 1.2481 0.0833 1.2481 1.1172
No log 11.9167 286 1.1781 0.0445 1.1781 1.0854
No log 12.0 288 1.1913 0.0445 1.1913 1.0915
No log 12.0833 290 1.2429 0.0833 1.2429 1.1148
No log 12.1667 292 1.4023 0.1486 1.4023 1.1842
No log 12.25 294 1.6140 0.2611 1.6140 1.2704
No log 12.3333 296 1.7399 0.2006 1.7399 1.3190
No log 12.4167 298 1.6377 0.2006 1.6377 1.2797
No log 12.5 300 1.3639 0.2690 1.3639 1.1679
No log 12.5833 302 1.1593 0.1552 1.1593 1.0767
No log 12.6667 304 1.1340 0.0987 1.1340 1.0649
No log 12.75 306 1.2413 0.0833 1.2413 1.1141
No log 12.8333 308 1.4031 0.2260 1.4031 1.1845
No log 12.9167 310 1.5234 0.2062 1.5234 1.2342
No log 13.0 312 1.5060 0.2239 1.5060 1.2272
No log 13.0833 314 1.5631 0.2239 1.5631 1.2502
No log 13.1667 316 1.4761 0.2602 1.4761 1.2150
No log 13.25 318 1.3872 0.2143 1.3872 1.1778
No log 13.3333 320 1.4210 0.1886 1.4210 1.1921
No log 13.4167 322 1.4445 0.2203 1.4445 1.2019
No log 13.5 324 1.4133 0.1886 1.4133 1.1888
No log 13.5833 326 1.3446 0.2455 1.3446 1.1596
No log 13.6667 328 1.2757 0.2640 1.2757 1.1295
No log 13.75 330 1.3471 0.2602 1.3471 1.1606
No log 13.8333 332 1.4973 0.2568 1.4973 1.2237
No log 13.9167 334 1.4684 0.2568 1.4684 1.2118
No log 14.0 336 1.2884 0.2367 1.2884 1.1351
No log 14.0833 338 1.2222 0.2752 1.2222 1.1055
No log 14.1667 340 1.3034 0.2602 1.3034 1.1417
No log 14.25 342 1.4226 0.2391 1.4226 1.1927
No log 14.3333 344 1.4090 0.2771 1.4090 1.1870
No log 14.4167 346 1.4275 0.2482 1.4275 1.1948
No log 14.5 348 1.5037 0.2159 1.5037 1.2262
No log 14.5833 350 1.7075 0.1911 1.7075 1.3067
No log 14.6667 352 1.7701 0.2488 1.7701 1.3305
No log 14.75 354 1.6621 0.2568 1.6621 1.2892
No log 14.8333 356 1.5319 0.2004 1.5319 1.2377
No log 14.9167 358 1.4287 0.1142 1.4287 1.1953
No log 15.0 360 1.4552 0.1228 1.4552 1.2063
No log 15.0833 362 1.5100 0.2752 1.5100 1.2288
No log 15.1667 364 1.5936 0.2568 1.5936 1.2624
No log 15.25 366 1.7225 0.2832 1.7225 1.3125
No log 15.3333 368 1.6454 0.2752 1.6454 1.2827
No log 15.4167 370 1.4984 0.1744 1.4984 1.2241
No log 15.5 372 1.3245 0.0781 1.3245 1.1509
No log 15.5833 374 1.2204 0.0401 1.2204 1.1047
No log 15.6667 376 1.2190 0.1816 1.2190 1.1041
No log 15.75 378 1.2737 0.2372 1.2737 1.1286
No log 15.8333 380 1.4724 0.2372 1.4724 1.2134
No log 15.9167 382 1.6224 0.2474 1.6224 1.2738
No log 16.0 384 1.6983 0.2474 1.6983 1.3032
No log 16.0833 386 1.6882 0.2292 1.6882 1.2993
No log 16.1667 388 1.6167 0.2184 1.6167 1.2715
No log 16.25 390 1.5911 0.1727 1.5911 1.2614
No log 16.3333 392 1.6012 0.1413 1.6012 1.2654
No log 16.4167 394 1.6018 0.1413 1.6018 1.2656
No log 16.5 396 1.5745 0.1413 1.5745 1.2548
No log 16.5833 398 1.5044 0.0380 1.5044 1.2265
No log 16.6667 400 1.4273 0.2126 1.4273 1.1947
No log 16.75 402 1.3733 0.2709 1.3733 1.1719
No log 16.8333 404 1.3338 0.2474 1.3338 1.1549
No log 16.9167 406 1.4326 0.2793 1.4326 1.1969
No log 17.0 408 1.4598 0.3429 1.4598 1.2082
No log 17.0833 410 1.3329 0.2793 1.3329 1.1545
No log 17.1667 412 1.2201 0.2795 1.2201 1.1046
No log 17.25 414 1.2959 0.3107 1.2959 1.1384
No log 17.3333 416 1.3171 0.2709 1.3171 1.1477
No log 17.4167 418 1.3296 0.2126 1.3296 1.1531
No log 17.5 420 1.4023 0.2065 1.4023 1.1842
No log 17.5833 422 1.4626 0.2752 1.4626 1.2094
No log 17.6667 424 1.4726 0.2752 1.4726 1.2135
No log 17.75 426 1.4049 0.2424 1.4049 1.1853
No log 17.8333 428 1.2632 0.2203 1.2632 1.1239
No log 17.9167 430 1.2019 0.2592 1.2019 1.0963
No log 18.0 432 1.2726 0.2506 1.2726 1.1281
No log 18.0833 434 1.4532 0.2752 1.4532 1.2055
No log 18.1667 436 1.5114 0.2832 1.5114 1.2294
No log 18.25 438 1.5194 0.2752 1.5194 1.2326
No log 18.3333 440 1.3969 0.2065 1.3969 1.1819
No log 18.4167 442 1.2172 0.0781 1.2172 1.1033
No log 18.5 444 1.1721 0.1202 1.1721 1.0826
No log 18.5833 446 1.2653 0.1744 1.2653 1.1249
No log 18.6667 448 1.4407 0.2944 1.4407 1.2003
No log 18.75 450 1.5317 0.2869 1.5317 1.2376
No log 18.8333 452 1.5388 0.2482 1.5388 1.2405
No log 18.9167 454 1.5500 0.2832 1.5500 1.2450
No log 19.0 456 1.5220 0.2665 1.5220 1.2337
No log 19.0833 458 1.4073 0.1142 1.4073 1.1863
No log 19.1667 460 1.3581 0.0401 1.3581 1.1654
No log 19.25 462 1.3209 0.0401 1.3209 1.1493
No log 19.3333 464 1.3631 0.0401 1.3631 1.1675
No log 19.4167 466 1.4320 0.2065 1.4320 1.1966
No log 19.5 468 1.4763 0.2372 1.4763 1.2150
No log 19.5833 470 1.5171 0.2944 1.5171 1.2317
No log 19.6667 472 1.5296 0.2372 1.5296 1.2368
No log 19.75 474 1.4949 0.2065 1.4949 1.2227
No log 19.8333 476 1.3820 0.0401 1.3820 1.1756
No log 19.9167 478 1.2766 0.0401 1.2766 1.1299
No log 20.0 480 1.2088 0.0401 1.2088 1.0994
No log 20.0833 482 1.1668 0.0833 1.1668 1.0802
No log 20.1667 484 1.1932 0.0833 1.1932 1.0923
No log 20.25 486 1.3105 0.1744 1.3105 1.1448
No log 20.3333 488 1.5010 0.2709 1.5010 1.2252
No log 20.4167 490 1.5472 0.2709 1.5472 1.2438
No log 20.5 492 1.5274 0.2709 1.5274 1.2359
No log 20.5833 494 1.4752 0.2709 1.4752 1.2146
No log 20.6667 496 1.4060 0.2424 1.4060 1.1857
No log 20.75 498 1.4425 0.2424 1.4425 1.2011
0.2652 20.8333 500 1.4214 0.2126 1.4214 1.1922
0.2652 20.9167 502 1.3497 0.1486 1.3497 1.1618
0.2652 21.0 504 1.2838 0.1407 1.2838 1.1331
0.2652 21.0833 506 1.2817 0.2065 1.2817 1.1321
0.2652 21.1667 508 1.3237 0.2424 1.3237 1.1505
0.2652 21.25 510 1.3996 0.2709 1.3996 1.1831
0.2652 21.3333 512 1.4700 0.2709 1.4700 1.2124
0.2652 21.4167 514 1.4987 0.2982 1.4987 1.2242
0.2652 21.5 516 1.4594 0.2982 1.4594 1.2081
0.2652 21.5833 518 1.3836 0.2709 1.3836 1.1763
0.2652 21.6667 520 1.3495 0.2424 1.3495 1.1617
0.2652 21.75 522 1.3638 0.2424 1.3638 1.1678
0.2652 21.8333 524 1.3639 0.2424 1.3639 1.1679
0.2652 21.9167 526 1.3281 0.2424 1.3281 1.1524
0.2652 22.0 528 1.2734 0.2424 1.2734 1.1285
0.2652 22.0833 530 1.2183 0.2506 1.2183 1.1038
0.2652 22.1667 532 1.2544 0.2709 1.2544 1.1200
0.2652 22.25 534 1.3320 0.2709 1.3320 1.1541
0.2652 22.3333 536 1.4126 0.2709 1.4126 1.1885
0.2652 22.4167 538 1.4914 0.3052 1.4914 1.2212
0.2652 22.5 540 1.4215 0.3018 1.4215 1.1923
0.2652 22.5833 542 1.3819 0.3365 1.3819 1.1755
0.2652 22.6667 544 1.3734 0.2982 1.3734 1.1719
0.2652 22.75 546 1.3733 0.2709 1.3733 1.1719
0.2652 22.8333 548 1.3477 0.2424 1.3477 1.1609
0.2652 22.9167 550 1.3121 0.2065 1.3121 1.1455
0.2652 23.0 552 1.2782 0.1407 1.2782 1.1306
0.2652 23.0833 554 1.2868 0.1407 1.2868 1.1344
0.2652 23.1667 556 1.3193 0.1407 1.3193 1.1486
0.2652 23.25 558 1.3705 0.2126 1.3705 1.1707

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.4.0+cu118
  • Datasets 2.21.0
  • Tokenizers 0.19.1
Downloads last month
-
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for MayBashendy/ArabicNewSplits7_usingWellWrittenEssays_FineTuningAraBERT_run3_AugV5_k10_task5_organization

Finetuned
(4019)
this model