ArabicNewSplits7_FineTuningAraBERT_run3_AugV5_k9_task7_organization

This model is a fine-tuned version of aubmindlab/bert-base-arabertv02 on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 0.6445
  • Qwk: 0.4404
  • Mse: 0.6445
  • Rmse: 0.8028

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 100

Training results

Training Loss Epoch Step Validation Loss Qwk Mse Rmse
No log 0.0833 2 2.3861 -0.0262 2.3861 1.5447
No log 0.1667 4 1.0688 0.2508 1.0688 1.0338
No log 0.25 6 1.2133 -0.1162 1.2133 1.1015
No log 0.3333 8 1.5554 -0.1924 1.5554 1.2471
No log 0.4167 10 1.1353 -0.0622 1.1353 1.0655
No log 0.5 12 0.9150 0.0 0.9150 0.9566
No log 0.5833 14 0.9082 0.0541 0.9082 0.9530
No log 0.6667 16 0.9328 0.1416 0.9328 0.9658
No log 0.75 18 1.0116 0.2412 1.0116 1.0058
No log 0.8333 20 0.9747 0.1277 0.9747 0.9873
No log 0.9167 22 0.8724 0.0 0.8724 0.9340
No log 1.0 24 0.9295 0.0 0.9295 0.9641
No log 1.0833 26 0.9670 0.1345 0.9670 0.9834
No log 1.1667 28 1.0301 0.2533 1.0301 1.0149
No log 1.25 30 0.9275 0.2408 0.9275 0.9631
No log 1.3333 32 0.7627 0.0 0.7627 0.8733
No log 1.4167 34 0.7109 0.0 0.7109 0.8432
No log 1.5 36 0.7444 0.0 0.7444 0.8628
No log 1.5833 38 0.7567 0.0 0.7567 0.8699
No log 1.6667 40 0.7563 0.0481 0.7563 0.8697
No log 1.75 42 0.7223 -0.0027 0.7223 0.8499
No log 1.8333 44 0.7282 0.0444 0.7282 0.8533
No log 1.9167 46 0.7845 0.2132 0.7845 0.8857
No log 2.0 48 0.8331 0.3131 0.8331 0.9128
No log 2.0833 50 0.8155 0.2736 0.8155 0.9030
No log 2.1667 52 0.7107 0.1729 0.7107 0.8430
No log 2.25 54 0.6970 0.1365 0.6970 0.8349
No log 2.3333 56 0.6822 0.1321 0.6822 0.8260
No log 2.4167 58 0.6988 0.2118 0.6988 0.8360
No log 2.5 60 0.7431 0.2087 0.7431 0.8620
No log 2.5833 62 0.8569 0.3192 0.8569 0.9257
No log 2.6667 64 0.8842 0.3601 0.8842 0.9403
No log 2.75 66 0.7329 0.2992 0.7329 0.8561
No log 2.8333 68 0.6165 0.3862 0.6165 0.7852
No log 2.9167 70 0.7260 0.3637 0.7260 0.8521
No log 3.0 72 0.7028 0.3637 0.7028 0.8383
No log 3.0833 74 0.6228 0.2744 0.6228 0.7892
No log 3.1667 76 0.9057 0.3892 0.9057 0.9517
No log 3.25 78 0.9123 0.3892 0.9123 0.9552
No log 3.3333 80 0.6551 0.3135 0.6551 0.8094
No log 3.4167 82 0.6258 0.4103 0.6258 0.7911
No log 3.5 84 0.6210 0.3864 0.6210 0.7880
No log 3.5833 86 0.6178 0.3581 0.6178 0.7860
No log 3.6667 88 0.6578 0.3492 0.6578 0.8110
No log 3.75 90 0.6617 0.2973 0.6617 0.8135
No log 3.8333 92 0.7579 0.3384 0.7579 0.8706
No log 3.9167 94 0.9451 0.3538 0.9451 0.9721
No log 4.0 96 0.8619 0.4064 0.8619 0.9284
No log 4.0833 98 0.9659 0.3274 0.9659 0.9828
No log 4.1667 100 1.2205 0.3257 1.2205 1.1048
No log 4.25 102 0.8914 0.3377 0.8914 0.9441
No log 4.3333 104 0.7828 0.3343 0.7828 0.8848
No log 4.4167 106 0.6971 0.4640 0.6971 0.8349
No log 4.5 108 0.6775 0.4137 0.6775 0.8231
No log 4.5833 110 0.7189 0.3746 0.7189 0.8479
No log 4.6667 112 1.0208 0.3460 1.0208 1.0103
No log 4.75 114 0.9867 0.3517 0.9867 0.9933
No log 4.8333 116 0.7478 0.3606 0.7478 0.8648
No log 4.9167 118 0.6217 0.3713 0.6217 0.7885
No log 5.0 120 0.6146 0.4229 0.6146 0.7840
No log 5.0833 122 0.6523 0.3032 0.6523 0.8076
No log 5.1667 124 0.7586 0.4154 0.7586 0.8710
No log 5.25 126 0.7891 0.3287 0.7891 0.8883
No log 5.3333 128 0.8266 0.3228 0.8266 0.9092
No log 5.4167 130 0.7153 0.3473 0.7153 0.8457
No log 5.5 132 0.6211 0.3950 0.6211 0.7881
No log 5.5833 134 0.6230 0.3950 0.6230 0.7893
No log 5.6667 136 0.6129 0.4919 0.6129 0.7829
No log 5.75 138 0.6446 0.4764 0.6446 0.8029
No log 5.8333 140 0.5737 0.4763 0.5737 0.7575
No log 5.9167 142 0.5839 0.4349 0.5839 0.7642
No log 6.0 144 0.5840 0.4486 0.5840 0.7642
No log 6.0833 146 0.5588 0.4934 0.5588 0.7475
No log 6.1667 148 0.5758 0.4234 0.5758 0.7588
No log 6.25 150 0.5744 0.4314 0.5744 0.7579
No log 6.3333 152 0.5929 0.4762 0.5929 0.7700
No log 6.4167 154 0.5972 0.4747 0.5972 0.7728
No log 6.5 156 0.6933 0.3746 0.6933 0.8326
No log 6.5833 158 0.6783 0.3494 0.6783 0.8236
No log 6.6667 160 0.6313 0.3865 0.6313 0.7945
No log 6.75 162 0.6417 0.3475 0.6417 0.8011
No log 6.8333 164 0.6380 0.4482 0.6380 0.7987
No log 6.9167 166 0.6554 0.3833 0.6554 0.8096
No log 7.0 168 0.6561 0.3994 0.6561 0.8100
No log 7.0833 170 0.6338 0.4423 0.6338 0.7961
No log 7.1667 172 0.6704 0.4186 0.6704 0.8188
No log 7.25 174 0.7031 0.4464 0.7031 0.8385
No log 7.3333 176 0.6456 0.4044 0.6456 0.8035
No log 7.4167 178 0.6423 0.4362 0.6423 0.8015
No log 7.5 180 0.6253 0.3859 0.6253 0.7908
No log 7.5833 182 0.6247 0.4724 0.6247 0.7904
No log 7.6667 184 0.6985 0.4502 0.6985 0.8358
No log 7.75 186 0.7874 0.4080 0.7874 0.8873
No log 7.8333 188 0.7232 0.4562 0.7232 0.8504
No log 7.9167 190 0.5980 0.5115 0.5980 0.7733
No log 8.0 192 0.5940 0.4283 0.5940 0.7707
No log 8.0833 194 0.6622 0.5251 0.6622 0.8138
No log 8.1667 196 0.6828 0.5251 0.6828 0.8263
No log 8.25 198 0.6211 0.4892 0.6211 0.7881
No log 8.3333 200 0.5783 0.4753 0.5783 0.7605
No log 8.4167 202 0.5689 0.5201 0.5689 0.7543
No log 8.5 204 0.5808 0.4370 0.5808 0.7621
No log 8.5833 206 0.7075 0.4076 0.7075 0.8411
No log 8.6667 208 0.8154 0.3151 0.8154 0.9030
No log 8.75 210 0.7373 0.3889 0.7373 0.8587
No log 8.8333 212 0.5944 0.4315 0.5944 0.7710
No log 8.9167 214 0.6189 0.4749 0.6189 0.7867
No log 9.0 216 0.8305 0.3657 0.8305 0.9113
No log 9.0833 218 0.9161 0.3963 0.9161 0.9571
No log 9.1667 220 0.7721 0.3665 0.7721 0.8787
No log 9.25 222 0.6371 0.4375 0.6371 0.7982
No log 9.3333 224 0.6205 0.3787 0.6205 0.7877
No log 9.4167 226 0.6123 0.3787 0.6123 0.7825
No log 9.5 228 0.6003 0.3738 0.6003 0.7748
No log 9.5833 230 0.5966 0.4547 0.5966 0.7724
No log 9.6667 232 0.5888 0.4547 0.5888 0.7673
No log 9.75 234 0.5939 0.5079 0.5939 0.7707
No log 9.8333 236 0.5920 0.4516 0.5920 0.7694
No log 9.9167 238 0.6036 0.4942 0.6036 0.7769
No log 10.0 240 0.6253 0.4895 0.6253 0.7908
No log 10.0833 242 0.6083 0.4655 0.6083 0.7799
No log 10.1667 244 0.5911 0.4314 0.5911 0.7689
No log 10.25 246 0.6084 0.4059 0.6084 0.7800
No log 10.3333 248 0.5929 0.4534 0.5929 0.7700
No log 10.4167 250 0.6150 0.3746 0.6150 0.7842
No log 10.5 252 0.6702 0.3918 0.6702 0.8187
No log 10.5833 254 0.7056 0.4030 0.7056 0.8400
No log 10.6667 256 0.6857 0.4030 0.6857 0.8281
No log 10.75 258 0.6241 0.3789 0.6241 0.7900
No log 10.8333 260 0.6103 0.2652 0.6103 0.7812
No log 10.9167 262 0.6379 0.3712 0.6379 0.7987
No log 11.0 264 0.7079 0.3918 0.7079 0.8414
No log 11.0833 266 0.6914 0.4067 0.6914 0.8315
No log 11.1667 268 0.6254 0.3789 0.6254 0.7908
No log 11.25 270 0.6153 0.3196 0.6153 0.7844
No log 11.3333 272 0.6273 0.3942 0.6273 0.7920
No log 11.4167 274 0.6253 0.3942 0.6253 0.7908
No log 11.5 276 0.6112 0.3498 0.6112 0.7818
No log 11.5833 278 0.5880 0.4970 0.5880 0.7668
No log 11.6667 280 0.5745 0.5208 0.5745 0.7579
No log 11.75 282 0.5690 0.4402 0.5690 0.7543
No log 11.8333 284 0.5765 0.5397 0.5765 0.7593
No log 11.9167 286 0.5803 0.5397 0.5803 0.7618
No log 12.0 288 0.5700 0.5208 0.5700 0.7550
No log 12.0833 290 0.5715 0.4949 0.5715 0.7560
No log 12.1667 292 0.5741 0.4949 0.5741 0.7577
No log 12.25 294 0.5827 0.5386 0.5827 0.7633
No log 12.3333 296 0.5889 0.4526 0.5889 0.7674
No log 12.4167 298 0.6442 0.3196 0.6442 0.8026
No log 12.5 300 0.6887 0.3963 0.6887 0.8299
No log 12.5833 302 0.6504 0.3544 0.6504 0.8065
No log 12.6667 304 0.6934 0.3723 0.6934 0.8327
No log 12.75 306 0.8008 0.4366 0.8008 0.8949
No log 12.8333 308 0.7390 0.3586 0.7390 0.8596
No log 12.9167 310 0.6421 0.3723 0.6421 0.8013
No log 13.0 312 0.5972 0.3387 0.5972 0.7728
No log 13.0833 314 0.5953 0.3701 0.5953 0.7716
No log 13.1667 316 0.6086 0.3545 0.6086 0.7801
No log 13.25 318 0.6444 0.4272 0.6444 0.8028
No log 13.3333 320 0.6416 0.4409 0.6416 0.8010
No log 13.4167 322 0.5895 0.3996 0.5895 0.7678
No log 13.5 324 0.5531 0.4681 0.5531 0.7437
No log 13.5833 326 0.5540 0.4660 0.5540 0.7443
No log 13.6667 328 0.5840 0.4409 0.5840 0.7642
No log 13.75 330 0.5991 0.4664 0.5991 0.7740
No log 13.8333 332 0.6170 0.4329 0.6170 0.7855
No log 13.9167 334 0.6538 0.4329 0.6538 0.8086
No log 14.0 336 0.6634 0.4067 0.6634 0.8145
No log 14.0833 338 0.6118 0.4067 0.6118 0.7822
No log 14.1667 340 0.5684 0.4639 0.5684 0.7539
No log 14.25 342 0.5613 0.5003 0.5613 0.7492
No log 14.3333 344 0.5615 0.5550 0.5615 0.7493
No log 14.4167 346 0.5801 0.4375 0.5801 0.7617
No log 14.5 348 0.6238 0.4089 0.6238 0.7898
No log 14.5833 350 0.5995 0.4414 0.5995 0.7743
No log 14.6667 352 0.5758 0.4524 0.5758 0.7588
No log 14.75 354 0.5621 0.5749 0.5621 0.7497
No log 14.8333 356 0.5582 0.5784 0.5582 0.7471
No log 14.9167 358 0.5866 0.4895 0.5866 0.7659
No log 15.0 360 0.6042 0.5104 0.6042 0.7773
No log 15.0833 362 0.5689 0.5141 0.5689 0.7543
No log 15.1667 364 0.5534 0.5250 0.5534 0.7439
No log 15.25 366 0.5606 0.4847 0.5606 0.7488
No log 15.3333 368 0.6238 0.4371 0.6238 0.7898
No log 15.4167 370 0.6620 0.3918 0.6620 0.8136
No log 15.5 372 0.6138 0.3723 0.6138 0.7835
No log 15.5833 374 0.5567 0.4576 0.5567 0.7461
No log 15.6667 376 0.5442 0.5326 0.5442 0.7377
No log 15.75 378 0.5467 0.4576 0.5467 0.7394
No log 15.8333 380 0.5747 0.4618 0.5747 0.7581
No log 15.9167 382 0.6159 0.4224 0.6159 0.7848
No log 16.0 384 0.5975 0.4076 0.5975 0.7730
No log 16.0833 386 0.5841 0.3763 0.5841 0.7643
No log 16.1667 388 0.5764 0.4124 0.5764 0.7592
No log 16.25 390 0.5764 0.4291 0.5764 0.7592
No log 16.3333 392 0.5972 0.3814 0.5972 0.7728
No log 16.4167 394 0.6119 0.3737 0.6119 0.7823
No log 16.5 396 0.6559 0.4067 0.6559 0.8099
No log 16.5833 398 0.6504 0.3894 0.6504 0.8065
No log 16.6667 400 0.6190 0.4330 0.6190 0.7868
No log 16.75 402 0.6098 0.3809 0.6098 0.7809
No log 16.8333 404 0.6216 0.3834 0.6216 0.7884
No log 16.9167 406 0.6170 0.3153 0.6170 0.7855
No log 17.0 408 0.6266 0.3839 0.6266 0.7916
No log 17.0833 410 0.7146 0.3630 0.7146 0.8454
No log 17.1667 412 0.7542 0.3433 0.7542 0.8685
No log 17.25 414 0.6881 0.4251 0.6881 0.8295
No log 17.3333 416 0.5863 0.4354 0.5863 0.7657
No log 17.4167 418 0.5586 0.4111 0.5586 0.7474
No log 17.5 420 0.5516 0.4729 0.5516 0.7427
No log 17.5833 422 0.5540 0.4729 0.5540 0.7443
No log 17.6667 424 0.5486 0.4637 0.5486 0.7407
No log 17.75 426 0.5424 0.4561 0.5424 0.7365
No log 17.8333 428 0.5477 0.4817 0.5477 0.7400
No log 17.9167 430 0.5465 0.4817 0.5465 0.7392
No log 18.0 432 0.5489 0.4677 0.5489 0.7409
No log 18.0833 434 0.5394 0.5133 0.5394 0.7344
No log 18.1667 436 0.5395 0.4657 0.5395 0.7345
No log 18.25 438 0.5372 0.4742 0.5372 0.7329
No log 18.3333 440 0.5369 0.4923 0.5369 0.7327
No log 18.4167 442 0.5362 0.4878 0.5362 0.7322
No log 18.5 444 0.5573 0.4795 0.5573 0.7465
No log 18.5833 446 0.5787 0.4243 0.5787 0.7607
No log 18.6667 448 0.5676 0.4027 0.5676 0.7534
No log 18.75 450 0.5677 0.4136 0.5677 0.7535
No log 18.8333 452 0.5715 0.4136 0.5715 0.7560
No log 18.9167 454 0.5776 0.3523 0.5776 0.7600
No log 19.0 456 0.6133 0.4076 0.6133 0.7831
No log 19.0833 458 0.6795 0.4307 0.6795 0.8243
No log 19.1667 460 0.7197 0.4307 0.7197 0.8484
No log 19.25 462 0.6925 0.4307 0.6925 0.8321
No log 19.3333 464 0.6562 0.4387 0.6562 0.8101
No log 19.4167 466 0.6074 0.4330 0.6074 0.7793
No log 19.5 468 0.5994 0.4076 0.5994 0.7742
No log 19.5833 470 0.6396 0.4470 0.6396 0.7998
No log 19.6667 472 0.6436 0.4387 0.6436 0.8022
No log 19.75 474 0.6064 0.3712 0.6064 0.7787
No log 19.8333 476 0.5614 0.4437 0.5614 0.7493
No log 19.9167 478 0.5333 0.4659 0.5333 0.7303
No log 20.0 480 0.5281 0.5065 0.5281 0.7267
No log 20.0833 482 0.5301 0.5305 0.5301 0.7281
No log 20.1667 484 0.5449 0.4659 0.5449 0.7382
No log 20.25 486 0.5657 0.3814 0.5657 0.7521
No log 20.3333 488 0.5985 0.4491 0.5985 0.7737
No log 20.4167 490 0.6211 0.4470 0.6211 0.7881
No log 20.5 492 0.5820 0.4330 0.5820 0.7629
No log 20.5833 494 0.5611 0.4044 0.5611 0.7491
No log 20.6667 496 0.5342 0.5440 0.5342 0.7309
No log 20.75 498 0.5250 0.5750 0.5250 0.7246
0.3138 20.8333 500 0.5246 0.5656 0.5246 0.7243
0.3138 20.9167 502 0.5340 0.5784 0.5340 0.7307
0.3138 21.0 504 0.5233 0.6267 0.5233 0.7234
0.3138 21.0833 506 0.5194 0.5915 0.5194 0.7207
0.3138 21.1667 508 0.5251 0.5554 0.5251 0.7247
0.3138 21.25 510 0.5455 0.5014 0.5455 0.7386
0.3138 21.3333 512 0.5444 0.5014 0.5444 0.7379
0.3138 21.4167 514 0.5318 0.5061 0.5318 0.7293
0.3138 21.5 516 0.5136 0.6068 0.5136 0.7167
0.3138 21.5833 518 0.5483 0.5404 0.5483 0.7405
0.3138 21.6667 520 0.5725 0.5063 0.5725 0.7566
0.3138 21.75 522 0.5446 0.4769 0.5446 0.7380
0.3138 21.8333 524 0.5163 0.5869 0.5163 0.7186
0.3138 21.9167 526 0.5242 0.5414 0.5242 0.7240
0.3138 22.0 528 0.5297 0.5414 0.5297 0.7278
0.3138 22.0833 530 0.5193 0.6154 0.5193 0.7206
0.3138 22.1667 532 0.5360 0.4808 0.5360 0.7321
0.3138 22.25 534 0.5916 0.4646 0.5916 0.7692
0.3138 22.3333 536 0.5983 0.4646 0.5983 0.7735
0.3138 22.4167 538 0.5558 0.5373 0.5558 0.7455
0.3138 22.5 540 0.5263 0.5750 0.5263 0.7255
0.3138 22.5833 542 0.5377 0.5479 0.5377 0.7333
0.3138 22.6667 544 0.5346 0.5750 0.5346 0.7312
0.3138 22.75 546 0.5384 0.5522 0.5384 0.7337
0.3138 22.8333 548 0.5711 0.4100 0.5711 0.7557
0.3138 22.9167 550 0.6296 0.4404 0.6296 0.7935
0.3138 23.0 552 0.6685 0.4404 0.6685 0.8176
0.3138 23.0833 554 0.6445 0.4404 0.6445 0.8028

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.4.0+cu118
  • Datasets 2.21.0
  • Tokenizers 0.19.1
Downloads last month
1
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for MayBashendy/ArabicNewSplits7_FineTuningAraBERT_run3_AugV5_k9_task7_organization

Finetuned
(4023)
this model