ArabicNewSplits7_usingALLEssays_FineTuningAraBERT_run1_AugV5_k5_task2_organization

This model is a fine-tuned version of aubmindlab/bert-base-arabertv02 on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 1.0508
  • Qwk: 0.3956
  • Mse: 1.0508
  • Rmse: 1.0251

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 100

Training results

Training Loss Epoch Step Validation Loss Qwk Mse Rmse
No log 0.1111 2 4.7995 -0.0104 4.7995 2.1908
No log 0.2222 4 2.6013 0.0426 2.6013 1.6128
No log 0.3333 6 1.7835 -0.0017 1.7835 1.3355
No log 0.4444 8 2.1171 0.0023 2.1171 1.4550
No log 0.5556 10 1.8754 0.0527 1.8754 1.3695
No log 0.6667 12 1.5438 0.1309 1.5438 1.2425
No log 0.7778 14 1.3481 0.0900 1.3481 1.1611
No log 0.8889 16 1.2906 0.1562 1.2906 1.1360
No log 1.0 18 1.2474 0.1990 1.2474 1.1169
No log 1.1111 20 1.2987 0.1725 1.2987 1.1396
No log 1.2222 22 1.6623 0.1972 1.6623 1.2893
No log 1.3333 24 1.9060 0.2153 1.9060 1.3806
No log 1.4444 26 1.8340 0.2050 1.8340 1.3542
No log 1.5556 28 2.0264 0.0978 2.0264 1.4235
No log 1.6667 30 1.9255 0.1309 1.9255 1.3876
No log 1.7778 32 1.6292 0.2530 1.6292 1.2764
No log 1.8889 34 1.4174 0.2781 1.4174 1.1905
No log 2.0 36 1.3262 0.2010 1.3262 1.1516
No log 2.1111 38 1.5493 0.1602 1.5493 1.2447
No log 2.2222 40 2.0593 0.0917 2.0593 1.4350
No log 2.3333 42 1.9298 0.1331 1.9298 1.3892
No log 2.4444 44 1.4915 0.1418 1.4915 1.2213
No log 2.5556 46 1.3495 0.1891 1.3495 1.1617
No log 2.6667 48 1.2510 0.1838 1.2510 1.1185
No log 2.7778 50 1.2099 0.1136 1.2099 1.1000
No log 2.8889 52 1.3000 0.1404 1.3000 1.1402
No log 3.0 54 1.6797 0.1301 1.6797 1.2960
No log 3.1111 56 2.0956 0.0917 2.0956 1.4476
No log 3.2222 58 2.2876 0.1172 2.2876 1.5125
No log 3.3333 60 2.2605 0.1452 2.2605 1.5035
No log 3.4444 62 1.7694 0.2774 1.7694 1.3302
No log 3.5556 64 1.2268 0.1725 1.2268 1.1076
No log 3.6667 66 1.0361 0.2662 1.0361 1.0179
No log 3.7778 68 1.0427 0.2916 1.0427 1.0211
No log 3.8889 70 1.0395 0.3070 1.0395 1.0196
No log 4.0 72 1.0259 0.2662 1.0259 1.0129
No log 4.1111 74 1.1436 0.1951 1.1436 1.0694
No log 4.2222 76 1.2420 0.1743 1.2420 1.1144
No log 4.3333 78 1.2537 0.1601 1.2537 1.1197
No log 4.4444 80 1.3807 0.2201 1.3807 1.1750
No log 4.5556 82 1.3775 0.2607 1.3775 1.1737
No log 4.6667 84 1.1711 0.2963 1.1711 1.0822
No log 4.7778 86 1.0613 0.3237 1.0613 1.0302
No log 4.8889 88 1.0948 0.2904 1.0948 1.0463
No log 5.0 90 1.0547 0.3111 1.0547 1.0270
No log 5.1111 92 1.1716 0.2814 1.1716 1.0824
No log 5.2222 94 1.4043 0.3276 1.4043 1.1850
No log 5.3333 96 1.8200 0.2832 1.8200 1.3491
No log 5.4444 98 1.8116 0.2832 1.8116 1.3460
No log 5.5556 100 1.3766 0.3229 1.3766 1.1733
No log 5.6667 102 1.1058 0.3534 1.1058 1.0516
No log 5.7778 104 0.9696 0.4454 0.9696 0.9847
No log 5.8889 106 0.9621 0.4803 0.9621 0.9808
No log 6.0 108 0.9579 0.4139 0.9579 0.9787
No log 6.1111 110 0.9927 0.4176 0.9927 0.9963
No log 6.2222 112 1.0614 0.4082 1.0614 1.0303
No log 6.3333 114 1.1482 0.2903 1.1482 1.0715
No log 6.4444 116 1.3326 0.3318 1.3326 1.1544
No log 6.5556 118 1.4352 0.2314 1.4352 1.1980
No log 6.6667 120 1.3882 0.2677 1.3882 1.1782
No log 6.7778 122 1.4130 0.1615 1.4130 1.1887
No log 6.8889 124 1.3149 0.2065 1.3149 1.1467
No log 7.0 126 1.1683 0.2825 1.1683 1.0809
No log 7.1111 128 1.1365 0.3052 1.1365 1.0661
No log 7.2222 130 1.0345 0.3862 1.0345 1.0171
No log 7.3333 132 1.0064 0.3573 1.0064 1.0032
No log 7.4444 134 0.9786 0.4091 0.9786 0.9893
No log 7.5556 136 0.9683 0.4005 0.9683 0.9840
No log 7.6667 138 0.9438 0.4798 0.9438 0.9715
No log 7.7778 140 0.9240 0.4652 0.9240 0.9613
No log 7.8889 142 0.9834 0.3678 0.9834 0.9917
No log 8.0 144 1.2589 0.4478 1.2589 1.1220
No log 8.1111 146 1.3317 0.4028 1.3317 1.1540
No log 8.2222 148 1.0633 0.4015 1.0633 1.0312
No log 8.3333 150 1.1091 0.3955 1.1091 1.0531
No log 8.4444 152 1.2291 0.4440 1.2291 1.1086
No log 8.5556 154 1.1035 0.3823 1.1035 1.0505
No log 8.6667 156 0.9729 0.3300 0.9729 0.9863
No log 8.7778 158 1.0264 0.3321 1.0264 1.0131
No log 8.8889 160 1.1898 0.3323 1.1898 1.0908
No log 9.0 162 1.3572 0.3752 1.3572 1.1650
No log 9.1111 164 1.3905 0.3902 1.3905 1.1792
No log 9.2222 166 1.1808 0.4186 1.1808 1.0867
No log 9.3333 168 0.9648 0.3967 0.9648 0.9822
No log 9.4444 170 0.8239 0.5072 0.8239 0.9077
No log 9.5556 172 0.8002 0.5102 0.8002 0.8945
No log 9.6667 174 0.8471 0.4707 0.8471 0.9204
No log 9.7778 176 0.8146 0.5315 0.8146 0.9025
No log 9.8889 178 0.8111 0.5120 0.8111 0.9006
No log 10.0 180 0.9130 0.4843 0.9130 0.9555
No log 10.1111 182 0.9537 0.4765 0.9537 0.9766
No log 10.2222 184 0.8588 0.4581 0.8588 0.9267
No log 10.3333 186 0.8338 0.5658 0.8338 0.9132
No log 10.4444 188 0.8749 0.4703 0.8749 0.9353
No log 10.5556 190 0.9290 0.4843 0.9290 0.9638
No log 10.6667 192 0.9278 0.4873 0.9278 0.9632
No log 10.7778 194 0.7988 0.5659 0.7988 0.8938
No log 10.8889 196 0.7688 0.5216 0.7688 0.8768
No log 11.0 198 0.7643 0.5777 0.7643 0.8742
No log 11.1111 200 0.7803 0.5592 0.7803 0.8834
No log 11.2222 202 0.7676 0.5659 0.7676 0.8761
No log 11.3333 204 0.7557 0.5659 0.7557 0.8693
No log 11.4444 206 0.7656 0.5777 0.7656 0.8750
No log 11.5556 208 0.7709 0.5659 0.7709 0.8780
No log 11.6667 210 0.7970 0.5658 0.7970 0.8927
No log 11.7778 212 0.8303 0.5607 0.8303 0.9112
No log 11.8889 214 0.8526 0.5607 0.8526 0.9234
No log 12.0 216 0.8665 0.5172 0.8665 0.9309
No log 12.1111 218 0.9742 0.3847 0.9742 0.9870
No log 12.2222 220 0.9624 0.3805 0.9624 0.9810
No log 12.3333 222 0.8979 0.4338 0.8979 0.9476
No log 12.4444 224 0.8765 0.5137 0.8765 0.9362
No log 12.5556 226 0.8549 0.5426 0.8549 0.9246
No log 12.6667 228 0.8465 0.4368 0.8465 0.9200
No log 12.7778 230 0.8783 0.4666 0.8783 0.9372
No log 12.8889 232 0.8706 0.4248 0.8706 0.9331
No log 13.0 234 0.8398 0.4590 0.8398 0.9164
No log 13.1111 236 0.8234 0.5821 0.8234 0.9074
No log 13.2222 238 0.8543 0.5239 0.8543 0.9243
No log 13.3333 240 0.8452 0.5477 0.8452 0.9194
No log 13.4444 242 0.8189 0.6107 0.8189 0.9049
No log 13.5556 244 0.8171 0.6131 0.8171 0.9039
No log 13.6667 246 0.8279 0.5835 0.8279 0.9099
No log 13.7778 248 0.8408 0.5658 0.8408 0.9170
No log 13.8889 250 0.8633 0.5361 0.8633 0.9291
No log 14.0 252 0.9154 0.4672 0.9154 0.9567
No log 14.1111 254 0.9613 0.4919 0.9613 0.9805
No log 14.2222 256 0.9748 0.4431 0.9748 0.9873
No log 14.3333 258 0.9907 0.4191 0.9907 0.9953
No log 14.4444 260 0.9831 0.4156 0.9831 0.9915
No log 14.5556 262 0.9903 0.4156 0.9903 0.9951
No log 14.6667 264 0.9578 0.4247 0.9578 0.9787
No log 14.7778 266 0.9598 0.3956 0.9598 0.9797
No log 14.8889 268 0.9541 0.4549 0.9541 0.9768
No log 15.0 270 0.9730 0.4247 0.9730 0.9864
No log 15.1111 272 0.9297 0.4763 0.9297 0.9642
No log 15.2222 274 0.8785 0.4741 0.8785 0.9373
No log 15.3333 276 0.8695 0.4305 0.8695 0.9325
No log 15.4444 278 0.9167 0.3534 0.9167 0.9575
No log 15.5556 280 0.8861 0.4155 0.8861 0.9413
No log 15.6667 282 0.8361 0.4397 0.8361 0.9144
No log 15.7778 284 0.9007 0.4613 0.9007 0.9491
No log 15.8889 286 0.9848 0.3958 0.9848 0.9924
No log 16.0 288 0.9544 0.4598 0.9544 0.9770
No log 16.1111 290 0.8467 0.5374 0.8467 0.9202
No log 16.2222 292 0.7954 0.5470 0.7954 0.8918
No log 16.3333 294 0.8342 0.4703 0.8342 0.9134
No log 16.4444 296 0.8790 0.4613 0.8790 0.9376
No log 16.5556 298 0.8922 0.5122 0.8922 0.9446
No log 16.6667 300 0.9233 0.4919 0.9233 0.9609
No log 16.7778 302 0.9744 0.4919 0.9744 0.9871
No log 16.8889 304 1.0808 0.4514 1.0808 1.0396
No log 17.0 306 1.2888 0.3862 1.2888 1.1352
No log 17.1111 308 1.3538 0.3248 1.3538 1.1635
No log 17.2222 310 1.1886 0.4247 1.1886 1.0902
No log 17.3333 312 0.9956 0.4518 0.9956 0.9978
No log 17.4444 314 0.8824 0.4139 0.8824 0.9393
No log 17.5556 316 0.8540 0.4450 0.8540 0.9241
No log 17.6667 318 0.8559 0.4865 0.8559 0.9252
No log 17.7778 320 0.8293 0.5553 0.8293 0.9107
No log 17.8889 322 0.8175 0.6011 0.8175 0.9042
No log 18.0 324 0.8181 0.5320 0.8181 0.9045
No log 18.1111 326 0.8355 0.4944 0.8355 0.9140
No log 18.2222 328 0.8414 0.4803 0.8414 0.9173
No log 18.3333 330 0.8554 0.4364 0.8554 0.9249
No log 18.4444 332 0.8908 0.3952 0.8908 0.9438
No log 18.5556 334 0.9401 0.4211 0.9401 0.9696
No log 18.6667 336 0.9890 0.4104 0.9890 0.9945
No log 18.7778 338 1.0093 0.4104 1.0093 1.0046
No log 18.8889 340 0.9778 0.4211 0.9778 0.9889
No log 19.0 342 0.9881 0.3902 0.9881 0.9941
No log 19.1111 344 1.1023 0.3770 1.1023 1.0499
No log 19.2222 346 1.1451 0.3805 1.1451 1.0701
No log 19.3333 348 1.0314 0.3939 1.0314 1.0156
No log 19.4444 350 0.9322 0.4211 0.9322 0.9655
No log 19.5556 352 0.8769 0.4521 0.8769 0.9365
No log 19.6667 354 0.8650 0.4140 0.8650 0.9301
No log 19.7778 356 0.8337 0.4359 0.8337 0.9131
No log 19.8889 358 0.8153 0.4260 0.8153 0.9030
No log 20.0 360 0.8072 0.4696 0.8072 0.8984
No log 20.1111 362 0.8046 0.5107 0.8046 0.8970
No log 20.2222 364 0.8006 0.5107 0.8006 0.8948
No log 20.3333 366 0.8019 0.5634 0.8019 0.8955
No log 20.4444 368 0.8053 0.5376 0.8053 0.8974
No log 20.5556 370 0.8229 0.4961 0.8229 0.9072
No log 20.6667 372 0.8507 0.4338 0.8507 0.9223
No log 20.7778 374 0.8700 0.4369 0.8700 0.9327
No log 20.8889 376 0.8509 0.4946 0.8509 0.9224
No log 21.0 378 0.8429 0.4946 0.8429 0.9181
No log 21.1111 380 0.8400 0.5024 0.8400 0.9165
No log 21.2222 382 0.8447 0.5076 0.8447 0.9191
No log 21.3333 384 0.8468 0.4946 0.8468 0.9202
No log 21.4444 386 0.8677 0.4725 0.8677 0.9315
No log 21.5556 388 0.8714 0.4947 0.8714 0.9335
No log 21.6667 390 0.8370 0.5291 0.8370 0.9149
No log 21.7778 392 0.8405 0.5223 0.8405 0.9168
No log 21.8889 394 0.8668 0.5504 0.8668 0.9310
No log 22.0 396 0.8460 0.5580 0.8460 0.9198
No log 22.1111 398 0.8277 0.4521 0.8277 0.9098
No log 22.2222 400 0.8181 0.4617 0.8181 0.9045
No log 22.3333 402 0.8409 0.5392 0.8409 0.9170
No log 22.4444 404 0.8543 0.5014 0.8543 0.9243
No log 22.5556 406 0.8626 0.4503 0.8626 0.9288
No log 22.6667 408 0.9168 0.4696 0.9168 0.9575
No log 22.7778 410 0.9212 0.4903 0.9212 0.9598
No log 22.8889 412 0.8534 0.4640 0.8534 0.9238
No log 23.0 414 0.8197 0.5181 0.8197 0.9054
No log 23.1111 416 0.8329 0.4884 0.8329 0.9126
No log 23.2222 418 0.8457 0.4938 0.8457 0.9196
No log 23.3333 420 0.8658 0.4814 0.8658 0.9305
No log 23.4444 422 0.8198 0.5880 0.8198 0.9054
No log 23.5556 424 0.7685 0.5708 0.7685 0.8766
No log 23.6667 426 0.7812 0.5348 0.7812 0.8838
No log 23.7778 428 0.8056 0.4404 0.8056 0.8976
No log 23.8889 430 0.7861 0.5226 0.7861 0.8866
No log 24.0 432 0.7976 0.5577 0.7976 0.8931
No log 24.1111 434 0.8493 0.4998 0.8493 0.9216
No log 24.2222 436 0.8361 0.5142 0.8361 0.9144
No log 24.3333 438 0.7808 0.5796 0.7808 0.8836
No log 24.4444 440 0.7840 0.6241 0.7840 0.8854
No log 24.5556 442 0.8476 0.5020 0.8476 0.9206
No log 24.6667 444 0.8635 0.4736 0.8635 0.9292
No log 24.7778 446 0.8227 0.5592 0.8227 0.9070
No log 24.8889 448 0.7778 0.6035 0.7778 0.8820
No log 25.0 450 0.8296 0.5250 0.8296 0.9108
No log 25.1111 452 0.8888 0.5380 0.8888 0.9428
No log 25.2222 454 0.8764 0.5385 0.8764 0.9362
No log 25.3333 456 0.8352 0.5770 0.8352 0.9139
No log 25.4444 458 0.8355 0.5420 0.8355 0.9141
No log 25.5556 460 0.8748 0.4672 0.8748 0.9353
No log 25.6667 462 0.8873 0.4854 0.8873 0.9420
No log 25.7778 464 0.8779 0.4615 0.8779 0.9369
No log 25.8889 466 0.8750 0.4879 0.8750 0.9354
No log 26.0 468 0.8798 0.5175 0.8798 0.9380
No log 26.1111 470 0.8940 0.4931 0.8940 0.9455
No log 26.2222 472 0.8747 0.5322 0.8747 0.9353
No log 26.3333 474 0.8517 0.6021 0.8517 0.9229
No log 26.4444 476 0.8424 0.6011 0.8424 0.9178
No log 26.5556 478 0.8469 0.5364 0.8469 0.9203
No log 26.6667 480 0.8661 0.4947 0.8661 0.9306
No log 26.7778 482 0.8847 0.4521 0.8847 0.9406
No log 26.8889 484 0.8986 0.4424 0.8986 0.9479
No log 27.0 486 0.9004 0.4331 0.9004 0.9489
No log 27.1111 488 0.8957 0.4331 0.8957 0.9464
No log 27.2222 490 0.8995 0.4331 0.8995 0.9484
No log 27.3333 492 0.9008 0.3728 0.9008 0.9491
No log 27.4444 494 0.9068 0.4476 0.9068 0.9523
No log 27.5556 496 0.9113 0.4476 0.9113 0.9546
No log 27.6667 498 0.8999 0.4219 0.8999 0.9486
0.2916 27.7778 500 0.9018 0.4234 0.9018 0.9496
0.2916 27.8889 502 0.9203 0.4042 0.9203 0.9593
0.2916 28.0 504 0.9271 0.3989 0.9271 0.9629
0.2916 28.1111 506 0.9232 0.4082 0.9232 0.9608
0.2916 28.2222 508 0.9002 0.3965 0.9002 0.9488
0.2916 28.3333 510 0.9113 0.4695 0.9113 0.9546
0.2916 28.4444 512 0.9233 0.4603 0.9233 0.9609
0.2916 28.5556 514 0.9087 0.4278 0.9087 0.9532
0.2916 28.6667 516 0.8896 0.3979 0.8896 0.9432
0.2916 28.7778 518 0.8781 0.4292 0.8781 0.9371
0.2916 28.8889 520 0.8763 0.4292 0.8763 0.9361
0.2916 29.0 522 0.8776 0.4385 0.8776 0.9368
0.2916 29.1111 524 0.8917 0.5119 0.8917 0.9443
0.2916 29.2222 526 0.8956 0.5119 0.8956 0.9464
0.2916 29.3333 528 0.9015 0.5023 0.9015 0.9495
0.2916 29.4444 530 0.9475 0.5312 0.9475 0.9734
0.2916 29.5556 532 1.0980 0.3949 1.0980 1.0479
0.2916 29.6667 534 1.3278 0.4136 1.3278 1.1523
0.2916 29.7778 536 1.3573 0.3885 1.3573 1.1650
0.2916 29.8889 538 1.2258 0.3715 1.2258 1.1072
0.2916 30.0 540 1.0508 0.3956 1.0508 1.0251

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.4.0+cu118
  • Datasets 2.21.0
  • Tokenizers 0.19.1
Downloads last month
1
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for MayBashendy/ArabicNewSplits7_usingALLEssays_FineTuningAraBERT_run1_AugV5_k5_task2_organization

Finetuned
(4023)
this model