ArabicNewSplits7_usingALLEssays_FineTuningAraBERT_run3_AugV5_k7_task2_organization

This model is a fine-tuned version of aubmindlab/bert-base-arabertv02 on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 1.1188
  • Qwk: 0.1458
  • Mse: 1.1188
  • Rmse: 1.0577

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 100

Training results

Training Loss Epoch Step Validation Loss Qwk Mse Rmse
No log 0.08 2 4.6972 0.0010 4.6972 2.1673
No log 0.16 4 2.6343 -0.0124 2.6343 1.6231
No log 0.24 6 1.9103 -0.0303 1.9103 1.3822
No log 0.32 8 2.1420 -0.0303 2.1420 1.4636
No log 0.4 10 2.0145 -0.0588 2.0145 1.4193
No log 0.48 12 1.4836 0.0393 1.4836 1.2180
No log 0.56 14 1.3687 0.0189 1.3687 1.1699
No log 0.64 16 1.6504 0.0227 1.6504 1.2847
No log 0.72 18 1.8336 0.0724 1.8336 1.3541
No log 0.8 20 1.4719 0.0 1.4719 1.2132
No log 0.88 22 1.3446 0.0317 1.3446 1.1596
No log 0.96 24 1.2821 0.0169 1.2821 1.1323
No log 1.04 26 1.2512 0.0662 1.2512 1.1186
No log 1.12 28 1.3062 0.1166 1.3062 1.1429
No log 1.2 30 1.2283 0.1882 1.2283 1.1083
No log 1.28 32 1.1716 0.2014 1.1716 1.0824
No log 1.3600 34 1.1546 0.2203 1.1546 1.0745
No log 1.44 36 1.2199 0.1959 1.2199 1.1045
No log 1.52 38 1.4011 0.0776 1.4011 1.1837
No log 1.6 40 1.5341 -0.0284 1.5341 1.2386
No log 1.6800 42 1.4434 0.0362 1.4434 1.2014
No log 1.76 44 1.3224 0.1587 1.3224 1.1500
No log 1.8400 46 1.4118 0.1495 1.4118 1.1882
No log 1.92 48 1.5515 0.0562 1.5515 1.2456
No log 2.0 50 1.6205 0.0477 1.6205 1.2730
No log 2.08 52 1.4590 0.0750 1.4590 1.2079
No log 2.16 54 1.2360 0.1865 1.2360 1.1117
No log 2.24 56 1.1530 0.2589 1.1530 1.0738
No log 2.32 58 1.1086 0.3487 1.1086 1.0529
No log 2.4 60 1.0171 0.3693 1.0171 1.0085
No log 2.48 62 0.9918 0.3644 0.9918 0.9959
No log 2.56 64 1.5648 0.2276 1.5648 1.2509
No log 2.64 66 2.2232 0.1508 2.2232 1.4910
No log 2.7200 68 2.5094 0.0957 2.5094 1.5841
No log 2.8 70 2.2929 0.1458 2.2929 1.5142
No log 2.88 72 1.8145 0.1393 1.8145 1.3470
No log 2.96 74 1.6290 0.26 1.6290 1.2763
No log 3.04 76 1.5003 0.0999 1.5003 1.2249
No log 3.12 78 1.4705 0.0667 1.4705 1.2126
No log 3.2 80 1.2793 0.1288 1.2793 1.1311
No log 3.2800 82 1.1021 0.3189 1.1021 1.0498
No log 3.36 84 1.0745 0.3195 1.0745 1.0366
No log 3.44 86 1.0577 0.3544 1.0577 1.0285
No log 3.52 88 1.0840 0.3385 1.0840 1.0411
No log 3.6 90 1.2260 0.1495 1.2260 1.1073
No log 3.68 92 1.4033 0.1604 1.4033 1.1846
No log 3.76 94 1.4678 0.1708 1.4678 1.2115
No log 3.84 96 1.6422 0.2005 1.6422 1.2815
No log 3.92 98 1.5920 0.1827 1.5920 1.2617
No log 4.0 100 1.4288 0.1585 1.4288 1.1953
No log 4.08 102 1.2982 0.1718 1.2982 1.1394
No log 4.16 104 1.5148 0.1955 1.5148 1.2308
No log 4.24 106 1.6718 0.1393 1.6718 1.2930
No log 4.32 108 1.7025 0.0315 1.7025 1.3048
No log 4.4 110 1.3943 0.1143 1.3943 1.1808
No log 4.48 112 1.0648 0.3590 1.0648 1.0319
No log 4.5600 114 0.9835 0.3382 0.9835 0.9917
No log 4.64 116 0.9899 0.3382 0.9899 0.9949
No log 4.72 118 1.0936 0.2815 1.0936 1.0458
No log 4.8 120 1.4935 0.2191 1.4935 1.2221
No log 4.88 122 1.7223 0.1639 1.7223 1.3124
No log 4.96 124 1.5621 0.1731 1.5621 1.2498
No log 5.04 126 1.4272 0.1707 1.4272 1.1946
No log 5.12 128 1.2120 0.2316 1.2120 1.1009
No log 5.2 130 1.0820 0.2913 1.0820 1.0402
No log 5.28 132 1.1376 0.3365 1.1376 1.0666
No log 5.36 134 1.3241 0.2716 1.3241 1.1507
No log 5.44 136 1.2779 0.3133 1.2779 1.1305
No log 5.52 138 1.0949 0.2986 1.0949 1.0464
No log 5.6 140 0.9852 0.2835 0.9852 0.9926
No log 5.68 142 0.9716 0.2835 0.9716 0.9857
No log 5.76 144 1.0641 0.3149 1.0641 1.0316
No log 5.84 146 1.1894 0.2851 1.1894 1.0906
No log 5.92 148 1.2159 0.3220 1.2159 1.1027
No log 6.0 150 1.2350 0.3363 1.2350 1.1113
No log 6.08 152 1.2218 0.2807 1.2218 1.1054
No log 6.16 154 1.1156 0.3100 1.1156 1.0562
No log 6.24 156 1.0519 0.3250 1.0519 1.0256
No log 6.32 158 1.0571 0.3319 1.0571 1.0282
No log 6.4 160 1.1789 0.2806 1.1789 1.0858
No log 6.48 162 1.4947 0.2624 1.4947 1.2226
No log 6.5600 164 1.5805 0.2190 1.5805 1.2572
No log 6.64 166 1.3044 0.2468 1.3044 1.1421
No log 6.72 168 1.0460 0.3956 1.0460 1.0228
No log 6.8 170 1.0101 0.3650 1.0101 1.0050
No log 6.88 172 1.0525 0.3773 1.0525 1.0259
No log 6.96 174 1.0959 0.3184 1.0959 1.0468
No log 7.04 176 1.1799 0.3547 1.1799 1.0862
No log 7.12 178 1.2684 0.3171 1.2684 1.1262
No log 7.2 180 1.2756 0.3171 1.2756 1.1294
No log 7.28 182 1.1637 0.3497 1.1637 1.0787
No log 7.36 184 1.1627 0.3255 1.1627 1.0783
No log 7.44 186 1.0311 0.3062 1.0311 1.0154
No log 7.52 188 1.0234 0.3328 1.0234 1.0116
No log 7.6 190 1.1154 0.3846 1.1154 1.0561
No log 7.68 192 1.1389 0.3572 1.1389 1.0672
No log 7.76 194 1.1452 0.3572 1.1452 1.0701
No log 7.84 196 1.1237 0.3805 1.1237 1.0601
No log 7.92 198 1.0510 0.3781 1.0510 1.0252
No log 8.0 200 1.0152 0.3985 1.0152 1.0076
No log 8.08 202 1.0130 0.4090 1.0130 1.0065
No log 8.16 204 1.1579 0.3370 1.1579 1.0760
No log 8.24 206 1.2453 0.3254 1.2453 1.1159
No log 8.32 208 1.1082 0.3624 1.1082 1.0527
No log 8.4 210 1.0183 0.3457 1.0183 1.0091
No log 8.48 212 1.0048 0.3394 1.0048 1.0024
No log 8.56 214 1.0563 0.3224 1.0563 1.0278
No log 8.64 216 1.1443 0.3389 1.1443 1.0697
No log 8.72 218 1.0464 0.3287 1.0464 1.0229
No log 8.8 220 0.9761 0.3871 0.9761 0.9880
No log 8.88 222 1.0665 0.4191 1.0665 1.0327
No log 8.96 224 1.2967 0.2863 1.2967 1.1387
No log 9.04 226 1.3910 0.2730 1.3910 1.1794
No log 9.12 228 1.2003 0.2941 1.2003 1.0956
No log 9.2 230 1.0314 0.3199 1.0314 1.0156
No log 9.28 232 1.0531 0.2864 1.0531 1.0262
No log 9.36 234 1.1067 0.3204 1.1067 1.0520
No log 9.44 236 1.0918 0.2913 1.0918 1.0449
No log 9.52 238 1.0527 0.2963 1.0527 1.0260
No log 9.6 240 1.0602 0.2632 1.0602 1.0296
No log 9.68 242 1.1596 0.2917 1.1596 1.0769
No log 9.76 244 1.2393 0.2438 1.2393 1.1132
No log 9.84 246 1.2023 0.2670 1.2023 1.0965
No log 9.92 248 1.0826 0.3062 1.0826 1.0405
No log 10.0 250 1.0374 0.3070 1.0374 1.0185
No log 10.08 252 1.0399 0.2782 1.0399 1.0198
No log 10.16 254 1.0803 0.2835 1.0803 1.0394
No log 10.24 256 1.1111 0.2795 1.1111 1.0541
No log 10.32 258 1.0677 0.3104 1.0677 1.0333
No log 10.4 260 1.0724 0.3397 1.0724 1.0356
No log 10.48 262 1.0456 0.2902 1.0456 1.0225
No log 10.56 264 1.0708 0.3415 1.0708 1.0348
No log 10.64 266 1.1257 0.3467 1.1257 1.0610
No log 10.72 268 1.1257 0.3231 1.1257 1.0610
No log 10.8 270 1.0903 0.2431 1.0903 1.0442
No log 10.88 272 1.0871 0.3311 1.0871 1.0427
No log 10.96 274 1.1153 0.1630 1.1153 1.0561
No log 11.04 276 1.1314 0.2410 1.1314 1.0637
No log 11.12 278 1.1292 0.2410 1.1292 1.0626
No log 11.2 280 1.1385 0.2939 1.1385 1.0670
No log 11.28 282 1.1766 0.3606 1.1766 1.0847
No log 11.36 284 1.3384 0.2339 1.3384 1.1569
No log 11.44 286 1.3517 0.2339 1.3517 1.1626
No log 11.52 288 1.2785 0.2852 1.2785 1.1307
No log 11.6 290 1.1548 0.3255 1.1548 1.0746
No log 11.68 292 1.1066 0.3319 1.1066 1.0520
No log 11.76 294 1.1091 0.2892 1.1091 1.0531
No log 11.84 296 1.1376 0.2891 1.1376 1.0666
No log 11.92 298 1.2231 0.3144 1.2231 1.1059
No log 12.0 300 1.4098 0.2833 1.4098 1.1874
No log 12.08 302 1.4463 0.3022 1.4463 1.2026
No log 12.16 304 1.2863 0.2807 1.2863 1.1342
No log 12.24 306 1.0595 0.2871 1.0595 1.0293
No log 12.32 308 0.9936 0.3619 0.9936 0.9968
No log 12.4 310 0.9667 0.3811 0.9667 0.9832
No log 12.48 312 0.9443 0.3607 0.9443 0.9718
No log 12.56 314 1.0020 0.3476 1.0020 1.0010
No log 12.64 316 1.1056 0.3355 1.1056 1.0515
No log 12.72 318 1.2237 0.2350 1.2237 1.1062
No log 12.8 320 1.2241 0.2350 1.2241 1.1064
No log 12.88 322 1.1377 0.3050 1.1377 1.0666
No log 12.96 324 0.9995 0.3728 0.9995 0.9997
No log 13.04 326 0.9537 0.3845 0.9537 0.9766
No log 13.12 328 0.9681 0.3674 0.9681 0.9839
No log 13.2 330 0.9759 0.3983 0.9759 0.9879
No log 13.28 332 1.0448 0.3224 1.0448 1.0222
No log 13.36 334 1.1840 0.2762 1.1840 1.0881
No log 13.44 336 1.2221 0.2181 1.2221 1.1055
No log 13.52 338 1.1641 0.2549 1.1641 1.0789
No log 13.6 340 1.0633 0.3224 1.0633 1.0312
No log 13.68 342 1.0134 0.2966 1.0134 1.0067
No log 13.76 344 0.9954 0.4042 0.9954 0.9977
No log 13.84 346 0.9959 0.4084 0.9959 0.9980
No log 13.92 348 1.0197 0.3250 1.0197 1.0098
No log 14.0 350 1.1784 0.2762 1.1784 1.0856
No log 14.08 352 1.3497 0.2962 1.3497 1.1618
No log 14.16 354 1.3374 0.2962 1.3374 1.1564
No log 14.24 356 1.1770 0.3307 1.1770 1.0849
No log 14.32 358 1.0345 0.2805 1.0345 1.0171
No log 14.4 360 1.0009 0.2989 1.0009 1.0004
No log 14.48 362 1.0125 0.2964 1.0125 1.0062
No log 14.56 364 1.0632 0.2796 1.0632 1.0311
No log 14.64 366 1.1490 0.2750 1.1490 1.0719
No log 14.72 368 1.1441 0.3078 1.1441 1.0696
No log 14.8 370 1.0944 0.2938 1.0944 1.0461
No log 14.88 372 1.0884 0.2417 1.0884 1.0433
No log 14.96 374 1.1480 0.3074 1.1480 1.0714
No log 15.04 376 1.2172 0.3050 1.2172 1.1033
No log 15.12 378 1.1741 0.2762 1.1741 1.0836
No log 15.2 380 1.1046 0.2085 1.1046 1.0510
No log 15.28 382 1.0563 0.2914 1.0563 1.0278
No log 15.36 384 1.0456 0.2815 1.0456 1.0226
No log 15.44 386 1.0793 0.3082 1.0793 1.0389
No log 15.52 388 1.1038 0.2986 1.1038 1.0506
No log 15.6 390 1.0812 0.2567 1.0812 1.0398
No log 15.68 392 1.1020 0.2567 1.1020 1.0498
No log 15.76 394 1.1680 0.2476 1.1680 1.0807
No log 15.84 396 1.2916 0.2746 1.2916 1.1365
No log 15.92 398 1.3351 0.2941 1.3351 1.1555
No log 16.0 400 1.2665 0.2673 1.2665 1.1254
No log 16.08 402 1.1447 0.2516 1.1447 1.0699
No log 16.16 404 1.0683 0.2986 1.0683 1.0336
No log 16.24 406 1.0189 0.3263 1.0189 1.0094
No log 16.32 408 1.0124 0.3660 1.0124 1.0062
No log 16.4 410 1.0335 0.3418 1.0335 1.0166
No log 16.48 412 1.0602 0.3149 1.0602 1.0296
No log 16.56 414 1.0277 0.3476 1.0277 1.0138
No log 16.64 416 1.0018 0.3476 1.0018 1.0009
No log 16.72 418 0.9813 0.3813 0.9813 0.9906
No log 16.8 420 0.9986 0.3572 0.9986 0.9993
No log 16.88 422 1.0557 0.3439 1.0557 1.0275
No log 16.96 424 1.1517 0.2941 1.1517 1.0732
No log 17.04 426 1.1436 0.3238 1.1436 1.0694
No log 17.12 428 1.0712 0.3347 1.0712 1.0350
No log 17.2 430 1.0100 0.3104 1.0100 1.0050
No log 17.28 432 1.0015 0.3365 1.0015 1.0008
No log 17.36 434 1.0275 0.3263 1.0275 1.0136
No log 17.44 436 1.0817 0.2605 1.0817 1.0401
No log 17.52 438 1.1811 0.2673 1.1811 1.0868
No log 17.6 440 1.2042 0.2585 1.2042 1.0973
No log 17.68 442 1.1200 0.2940 1.1200 1.0583
No log 17.76 444 1.0249 0.3555 1.0249 1.0124
No log 17.84 446 1.0109 0.3327 1.0109 1.0054
No log 17.92 448 0.9979 0.3037 0.9979 0.9990
No log 18.0 450 1.0004 0.3421 1.0004 1.0002
No log 18.08 452 1.0500 0.2986 1.0500 1.0247
No log 18.16 454 1.1483 0.3377 1.1483 1.0716
No log 18.24 456 1.1409 0.3377 1.1409 1.0681
No log 18.32 458 1.1081 0.2984 1.1081 1.0526
No log 18.4 460 1.0625 0.3121 1.0625 1.0308
No log 18.48 462 1.0175 0.3294 1.0175 1.0087
No log 18.56 464 1.0140 0.3493 1.0140 1.0070
No log 18.64 466 1.0222 0.3493 1.0222 1.0111
No log 18.72 468 1.0308 0.3415 1.0308 1.0153
No log 18.8 470 1.0616 0.3294 1.0616 1.0304
No log 18.88 472 1.1474 0.2492 1.1474 1.0712
No log 18.96 474 1.3246 0.3067 1.3246 1.1509
No log 19.04 476 1.4281 0.2481 1.4281 1.1950
No log 19.12 478 1.4051 0.2481 1.4051 1.1854
No log 19.2 480 1.2973 0.3067 1.2973 1.1390
No log 19.28 482 1.1683 0.2702 1.1683 1.0809
No log 19.36 484 1.0940 0.3427 1.0940 1.0460
No log 19.44 486 1.0698 0.2694 1.0698 1.0343
No log 19.52 488 1.0623 0.3090 1.0623 1.0307
No log 19.6 490 1.0601 0.3039 1.0601 1.0296
No log 19.68 492 1.0891 0.2461 1.0891 1.0436
No log 19.76 494 1.1770 0.3323 1.1770 1.0849
No log 19.84 496 1.2395 0.3238 1.2395 1.1133
No log 19.92 498 1.2661 0.3238 1.2661 1.1252
0.3289 20.0 500 1.2093 0.3323 1.2093 1.0997
0.3289 20.08 502 1.1367 0.2534 1.1367 1.0662
0.3289 20.16 504 1.1078 0.2505 1.1078 1.0525
0.3289 20.24 506 1.1016 0.2113 1.1016 1.0495
0.3289 20.32 508 1.1066 0.1458 1.1066 1.0519
0.3289 20.4 510 1.1188 0.1458 1.1188 1.0577

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.4.0+cu118
  • Datasets 2.21.0
  • Tokenizers 0.19.1
Downloads last month
-
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for MayBashendy/ArabicNewSplits7_usingALLEssays_FineTuningAraBERT_run3_AugV5_k7_task2_organization

Finetuned
(4019)
this model