ArabicNewSplits7_FineTuningAraBERT_run2_AugV5_k6_task7_organization

This model is a fine-tuned version of aubmindlab/bert-base-arabertv02 on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 0.8616
  • Qwk: 0.3365
  • Mse: 0.8616
  • Rmse: 0.9282

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 100

Training results

Training Loss Epoch Step Validation Loss Qwk Mse Rmse
No log 0.1176 2 2.7268 -0.0262 2.7268 1.6513
No log 0.2353 4 1.4160 0.0771 1.4160 1.1900
No log 0.3529 6 1.0447 -0.1304 1.0447 1.0221
No log 0.4706 8 1.2835 -0.2437 1.2835 1.1329
No log 0.5882 10 1.5149 -0.2448 1.5149 1.2308
No log 0.7059 12 1.6208 -0.1307 1.6208 1.2731
No log 0.8235 14 1.3323 -0.1803 1.3323 1.1542
No log 0.9412 16 1.0674 0.0327 1.0674 1.0331
No log 1.0588 18 0.9372 0.2227 0.9372 0.9681
No log 1.1765 20 0.9146 0.2504 0.9146 0.9564
No log 1.2941 22 0.8599 0.3425 0.8599 0.9273
No log 1.4118 24 0.8638 0.1407 0.8638 0.9294
No log 1.5294 26 0.9672 0.0 0.9672 0.9834
No log 1.6471 28 1.0828 -0.0660 1.0828 1.0406
No log 1.7647 30 1.0018 -0.0426 1.0018 1.0009
No log 1.8824 32 0.8917 0.0 0.8917 0.9443
No log 2.0 34 0.8040 0.0327 0.8040 0.8967
No log 2.1176 36 0.8083 0.2407 0.8083 0.8990
No log 2.2353 38 0.8258 0.2463 0.8258 0.9087
No log 2.3529 40 0.8547 0.3294 0.8547 0.9245
No log 2.4706 42 0.8759 0.3099 0.8759 0.9359
No log 2.5882 44 0.8845 0.0840 0.8845 0.9405
No log 2.7059 46 0.8996 0.1327 0.8996 0.9485
No log 2.8235 48 0.8627 0.1754 0.8627 0.9288
No log 2.9412 50 0.8310 0.1372 0.8310 0.9116
No log 3.0588 52 0.7444 0.0428 0.7444 0.8628
No log 3.1765 54 0.7285 0.1508 0.7285 0.8535
No log 3.2941 56 0.7495 0.1863 0.7495 0.8657
No log 3.4118 58 0.7724 0.1407 0.7724 0.8788
No log 3.5294 60 0.8269 -0.0511 0.8269 0.9093
No log 3.6471 62 0.9254 -0.0392 0.9254 0.9620
No log 3.7647 64 1.1030 -0.0264 1.1030 1.0502
No log 3.8824 66 1.2248 0.0081 1.2248 1.1067
No log 4.0 68 1.0681 -0.0112 1.0681 1.0335
No log 4.1176 70 1.0519 0.2173 1.0519 1.0256
No log 4.2353 72 1.0235 0.2076 1.0235 1.0117
No log 4.3529 74 1.0578 0.0120 1.0578 1.0285
No log 4.4706 76 1.0491 -0.0497 1.0491 1.0243
No log 4.5882 78 1.0570 0.1101 1.0570 1.0281
No log 4.7059 80 1.0229 0.1289 1.0229 1.0114
No log 4.8235 82 1.0278 0.1827 1.0278 1.0138
No log 4.9412 84 1.0840 0.1709 1.0840 1.0411
No log 5.0588 86 1.0419 0.2537 1.0419 1.0207
No log 5.1765 88 1.0101 0.0244 1.0101 1.0050
No log 5.2941 90 0.9974 -0.0108 0.9974 0.9987
No log 5.4118 92 1.0215 0.2124 1.0215 1.0107
No log 5.5294 94 1.2082 0.0948 1.2082 1.0992
No log 5.6471 96 1.2811 0.0852 1.2811 1.1319
No log 5.7647 98 1.1839 0.0881 1.1839 1.0881
No log 5.8824 100 1.0396 0.2728 1.0396 1.0196
No log 6.0 102 0.8852 0.2149 0.8852 0.9408
No log 6.1176 104 0.8421 0.1353 0.8421 0.9177
No log 6.2353 106 0.8525 0.2270 0.8525 0.9233
No log 6.3529 108 0.8804 0.3817 0.8804 0.9383
No log 6.4706 110 0.9760 0.3398 0.9760 0.9879
No log 6.5882 112 1.0358 0.2708 1.0358 1.0178
No log 6.7059 114 0.9867 0.28 0.9867 0.9933
No log 6.8235 116 0.9717 0.1843 0.9717 0.9858
No log 6.9412 118 1.0158 0.3043 1.0158 1.0079
No log 7.0588 120 1.2548 0.1409 1.2548 1.1202
No log 7.1765 122 1.3189 0.1357 1.3189 1.1484
No log 7.2941 124 1.1720 0.2231 1.1720 1.0826
No log 7.4118 126 0.9737 0.2076 0.9737 0.9867
No log 7.5294 128 0.9135 0.1373 0.9135 0.9558
No log 7.6471 130 0.9125 0.1867 0.9125 0.9552
No log 7.7647 132 0.9565 0.3110 0.9565 0.9780
No log 7.8824 134 0.8951 0.3069 0.8951 0.9461
No log 8.0 136 0.8671 0.2547 0.8671 0.9312
No log 8.1176 138 0.8372 0.2661 0.8372 0.9150
No log 8.2353 140 0.8692 0.2670 0.8692 0.9323
No log 8.3529 142 0.9025 0.4085 0.9025 0.9500
No log 8.4706 144 0.8102 0.4464 0.8102 0.9001
No log 8.5882 146 0.8685 0.3760 0.8685 0.9320
No log 8.7059 148 1.0533 0.2613 1.0533 1.0263
No log 8.8235 150 1.2251 0.2299 1.2251 1.1068
No log 8.9412 152 1.0062 0.3431 1.0062 1.0031
No log 9.0588 154 0.8419 0.3011 0.8419 0.9175
No log 9.1765 156 0.9172 0.4321 0.9172 0.9577
No log 9.2941 158 1.2512 0.1841 1.2512 1.1186
No log 9.4118 160 1.5511 0.1759 1.5511 1.2454
No log 9.5294 162 1.4350 0.1965 1.4350 1.1979
No log 9.6471 164 1.2424 0.1776 1.2424 1.1146
No log 9.7647 166 0.9863 0.3128 0.9863 0.9931
No log 9.8824 168 0.8805 0.3255 0.8805 0.9383
No log 10.0 170 0.8680 0.2402 0.8680 0.9317
No log 10.1176 172 0.8844 0.2993 0.8844 0.9404
No log 10.2353 174 0.9124 0.2937 0.9124 0.9552
No log 10.3529 176 0.9547 0.3667 0.9547 0.9771
No log 10.4706 178 0.9907 0.3582 0.9907 0.9953
No log 10.5882 180 0.9756 0.3425 0.9756 0.9877
No log 10.7059 182 0.9091 0.3125 0.9091 0.9535
No log 10.8235 184 0.8348 0.3609 0.8348 0.9137
No log 10.9412 186 0.8152 0.2172 0.8152 0.9029
No log 11.0588 188 0.8597 0.2988 0.8597 0.9272
No log 11.1765 190 0.9727 0.3483 0.9727 0.9863
No log 11.2941 192 1.0128 0.2905 1.0128 1.0064
No log 11.4118 194 0.9022 0.3645 0.9022 0.9499
No log 11.5294 196 0.8520 0.3520 0.8520 0.9230
No log 11.6471 198 0.8405 0.3520 0.8405 0.9168
No log 11.7647 200 0.8741 0.3520 0.8741 0.9350
No log 11.8824 202 0.8856 0.3520 0.8856 0.9411
No log 12.0 204 0.9468 0.3456 0.9468 0.9730
No log 12.1176 206 0.9945 0.3333 0.9945 0.9972
No log 12.2353 208 1.1129 0.2754 1.1129 1.0549
No log 12.3529 210 1.0393 0.2577 1.0393 1.0195
No log 12.4706 212 0.9105 0.3100 0.9105 0.9542
No log 12.5882 214 0.8820 0.3100 0.8820 0.9391
No log 12.7059 216 0.9812 0.2602 0.9812 0.9906
No log 12.8235 218 1.1435 0.2622 1.1435 1.0693
No log 12.9412 220 1.0573 0.2524 1.0573 1.0282
No log 13.0588 222 0.8061 0.3976 0.8061 0.8978
No log 13.1765 224 0.7873 0.3316 0.7873 0.8873
No log 13.2941 226 0.8001 0.3239 0.8001 0.8945
No log 13.4118 228 0.7290 0.3079 0.7290 0.8538
No log 13.5294 230 0.8035 0.3699 0.8035 0.8964
No log 13.6471 232 0.9809 0.2547 0.9809 0.9904
No log 13.7647 234 0.9399 0.2894 0.9399 0.9695
No log 13.8824 236 0.8184 0.3565 0.8184 0.9046
No log 14.0 238 0.8240 0.2479 0.8240 0.9077
No log 14.1176 240 0.8602 0.2747 0.8602 0.9275
No log 14.2353 242 0.9256 0.3355 0.9256 0.9621
No log 14.3529 244 1.0371 0.2636 1.0371 1.0184
No log 14.4706 246 1.1451 0.2312 1.1451 1.0701
No log 14.5882 248 1.0728 0.2312 1.0728 1.0357
No log 14.7059 250 0.9683 0.2677 0.9683 0.9840
No log 14.8235 252 0.9170 0.1884 0.9170 0.9576
No log 14.9412 254 0.8785 0.2616 0.8785 0.9373
No log 15.0588 256 0.8836 0.2887 0.8836 0.9400
No log 15.1765 258 0.9271 0.2564 0.9271 0.9628
No log 15.2941 260 0.9536 0.2627 0.9536 0.9765
No log 15.4118 262 1.0943 0.2354 1.0943 1.0461
No log 15.5294 264 0.9701 0.2531 0.9701 0.9850
No log 15.6471 266 0.8374 0.2975 0.8374 0.9151
No log 15.7647 268 0.8457 0.2832 0.8457 0.9196
No log 15.8824 270 0.9491 0.3022 0.9491 0.9742
No log 16.0 272 1.0803 0.2271 1.0803 1.0394
No log 16.1176 274 1.1770 0.2687 1.1770 1.0849
No log 16.2353 276 1.0714 0.2777 1.0714 1.0351
No log 16.3529 278 0.8794 0.2853 0.8794 0.9377
No log 16.4706 280 0.7700 0.4134 0.7700 0.8775
No log 16.5882 282 0.7647 0.3867 0.7647 0.8745
No log 16.7059 284 0.8027 0.3194 0.8027 0.8959
No log 16.8235 286 0.9047 0.2964 0.9047 0.9511
No log 16.9412 288 0.9419 0.2964 0.9419 0.9705
No log 17.0588 290 0.8386 0.3371 0.8386 0.9158
No log 17.1765 292 0.7069 0.3700 0.7069 0.8408
No log 17.2941 294 0.6875 0.3253 0.6875 0.8292
No log 17.4118 296 0.6975 0.3525 0.6975 0.8351
No log 17.5294 298 0.7814 0.4295 0.7814 0.8840
No log 17.6471 300 1.0722 0.3080 1.0722 1.0355
No log 17.7647 302 1.3476 0.2633 1.3476 1.1609
No log 17.8824 304 1.3075 0.3435 1.3075 1.1435
No log 18.0 306 1.0442 0.3131 1.0442 1.0218
No log 18.1176 308 0.8087 0.3486 0.8087 0.8993
No log 18.2353 310 0.7603 0.2203 0.7603 0.8719
No log 18.3529 312 0.7258 0.2838 0.7258 0.8519
No log 18.4706 314 0.7210 0.3936 0.7210 0.8491
No log 18.5882 316 0.7800 0.4038 0.7800 0.8832
No log 18.7059 318 0.8186 0.3461 0.8186 0.9048
No log 18.8235 320 0.7982 0.3803 0.7982 0.8934
No log 18.9412 322 0.7618 0.3868 0.7618 0.8728
No log 19.0588 324 0.7447 0.4104 0.7447 0.8630
No log 19.1765 326 0.7229 0.3567 0.7229 0.8503
No log 19.2941 328 0.7531 0.3667 0.7531 0.8678
No log 19.4118 330 0.8393 0.3699 0.8393 0.9161
No log 19.5294 332 0.9032 0.2881 0.9032 0.9504
No log 19.6471 334 0.8609 0.3217 0.8609 0.9278
No log 19.7647 336 0.8481 0.2702 0.8481 0.9209
No log 19.8824 338 0.8310 0.2958 0.8310 0.9116
No log 20.0 340 0.8113 0.2442 0.8113 0.9007
No log 20.1176 342 0.8384 0.2958 0.8384 0.9156
No log 20.2353 344 0.8773 0.2627 0.8773 0.9367
No log 20.3529 346 0.8885 0.2578 0.8885 0.9426
No log 20.4706 348 0.8163 0.2832 0.8163 0.9035
No log 20.5882 350 0.7591 0.3026 0.7591 0.8713
No log 20.7059 352 0.7077 0.3914 0.7077 0.8412
No log 20.8235 354 0.7117 0.4044 0.7117 0.8436
No log 20.9412 356 0.7709 0.4038 0.7709 0.8780
No log 21.0588 358 0.7742 0.3803 0.7742 0.8799
No log 21.1765 360 0.7234 0.4334 0.7234 0.8505
No log 21.2941 362 0.6835 0.4625 0.6835 0.8267
No log 21.4118 364 0.6635 0.4260 0.6635 0.8145
No log 21.5294 366 0.6582 0.4448 0.6582 0.8113
No log 21.6471 368 0.7036 0.4085 0.7036 0.8388
No log 21.7647 370 0.7369 0.4017 0.7369 0.8584
No log 21.8824 372 0.8565 0.4133 0.8565 0.9255
No log 22.0 374 0.9556 0.3645 0.9556 0.9775
No log 22.1176 376 0.8661 0.3803 0.8661 0.9306
No log 22.2353 378 0.7347 0.2835 0.7347 0.8572
No log 22.3529 380 0.6900 0.3078 0.6900 0.8306
No log 22.4706 382 0.6824 0.3106 0.6824 0.8261
No log 22.5882 384 0.6926 0.3197 0.6926 0.8322
No log 22.7059 386 0.7285 0.3656 0.7285 0.8535
No log 22.8235 388 0.8695 0.4177 0.8695 0.9325
No log 22.9412 390 1.0599 0.4007 1.0599 1.0295
No log 23.0588 392 1.0394 0.4140 1.0394 1.0195
No log 23.1765 394 0.8632 0.3451 0.8632 0.9291
No log 23.2941 396 0.7474 0.4263 0.7474 0.8645
No log 23.4118 398 0.7047 0.3665 0.7047 0.8395
No log 23.5294 400 0.7068 0.3106 0.7068 0.8407
No log 23.6471 402 0.7263 0.3015 0.7263 0.8523
No log 23.7647 404 0.7828 0.3329 0.7828 0.8847
No log 23.8824 406 0.9052 0.3151 0.9052 0.9514
No log 24.0 408 1.0943 0.2591 1.0943 1.0461
No log 24.1176 410 1.3340 0.3217 1.3340 1.1550
No log 24.2353 412 1.2969 0.3540 1.2969 1.1388
No log 24.3529 414 1.0768 0.3367 1.0768 1.0377
No log 24.4706 416 0.8345 0.3297 0.8345 0.9135
No log 24.5882 418 0.7071 0.3393 0.7071 0.8409
No log 24.7059 420 0.6914 0.3813 0.6914 0.8315
No log 24.8235 422 0.7184 0.3393 0.7184 0.8476
No log 24.9412 424 0.8414 0.3483 0.8414 0.9173
No log 25.0588 426 0.9714 0.2993 0.9714 0.9856
No log 25.1765 428 0.9608 0.2830 0.9608 0.9802
No log 25.2941 430 0.9432 0.2830 0.9432 0.9712
No log 25.4118 432 0.8561 0.3740 0.8561 0.9253
No log 25.5294 434 0.7719 0.3973 0.7719 0.8786
No log 25.6471 436 0.7288 0.4404 0.7288 0.8537
No log 25.7647 438 0.7430 0.4265 0.7430 0.8620
No log 25.8824 440 0.7966 0.4199 0.7966 0.8925
No log 26.0 442 0.8717 0.3542 0.8717 0.9336
No log 26.1176 444 0.9179 0.3847 0.9179 0.9581
No log 26.2353 446 0.8905 0.3906 0.8905 0.9437
No log 26.3529 448 0.7756 0.4385 0.7756 0.8807
No log 26.4706 450 0.7199 0.3866 0.7199 0.8485
No log 26.5882 452 0.7108 0.3866 0.7108 0.8431
No log 26.7059 454 0.7329 0.3866 0.7329 0.8561
No log 26.8235 456 0.8358 0.3868 0.8358 0.9142
No log 26.9412 458 1.0947 0.3174 1.0947 1.0463
No log 27.0588 460 1.2996 0.3451 1.2996 1.1400
No log 27.1765 462 1.3226 0.3097 1.3226 1.1500
No log 27.2941 464 1.1981 0.3240 1.1981 1.0946
No log 27.4118 466 0.9736 0.3290 0.9736 0.9867
No log 27.5294 468 0.7811 0.3344 0.7811 0.8838
No log 27.6471 470 0.7285 0.2684 0.7285 0.8535
No log 27.7647 472 0.7255 0.2058 0.7255 0.8518
No log 27.8824 474 0.7420 0.3340 0.7420 0.8614
No log 28.0 476 0.7995 0.4093 0.7995 0.8941
No log 28.1176 478 0.8740 0.3043 0.8740 0.9349
No log 28.2353 480 0.9132 0.3560 0.9132 0.9556
No log 28.3529 482 0.9617 0.3074 0.9617 0.9807
No log 28.4706 484 0.9800 0.3152 0.9800 0.9900
No log 28.5882 486 0.9650 0.3456 0.9650 0.9823
No log 28.7059 488 0.9087 0.3486 0.9087 0.9532
No log 28.8235 490 0.8491 0.3699 0.8491 0.9215
No log 28.9412 492 0.8215 0.3699 0.8215 0.9064
No log 29.0588 494 0.8292 0.3638 0.8292 0.9106
No log 29.1765 496 0.8664 0.3160 0.8664 0.9308
No log 29.2941 498 0.8573 0.2910 0.8573 0.9259
0.3331 29.4118 500 0.8194 0.3645 0.8194 0.9052
0.3331 29.5294 502 0.8208 0.3243 0.8208 0.9060
0.3331 29.6471 504 0.8122 0.3402 0.8122 0.9012
0.3331 29.7647 506 0.8064 0.3402 0.8064 0.8980
0.3331 29.8824 508 0.8150 0.3365 0.8150 0.9028
0.3331 30.0 510 0.8616 0.3365 0.8616 0.9282

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.4.0+cu118
  • Datasets 2.21.0
  • Tokenizers 0.19.1
Downloads last month
-
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for MayBashendy/ArabicNewSplits7_FineTuningAraBERT_run2_AugV5_k6_task7_organization

Finetuned
(4023)
this model