ArabicNewSplits7_FineTuningAraBERT_run3_AugV5_k1_task2_organization

This model is a fine-tuned version of aubmindlab/bert-base-arabertv02 on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 0.9552
  • Qwk: 0.3771
  • Mse: 0.9552
  • Rmse: 0.9774

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 100

Training results

Training Loss Epoch Step Validation Loss Qwk Mse Rmse
No log 0.3333 2 4.5513 0.0163 4.5513 2.1334
No log 0.6667 4 3.2075 -0.0038 3.2075 1.7909
No log 1.0 6 2.1306 0.1271 2.1306 1.4597
No log 1.3333 8 1.3656 -0.0047 1.3656 1.1686
No log 1.6667 10 1.2944 0.0883 1.2944 1.1377
No log 2.0 12 1.2570 0.0353 1.2570 1.1212
No log 2.3333 14 1.2407 0.0454 1.2407 1.1139
No log 2.6667 16 1.2351 0.0253 1.2351 1.1114
No log 3.0 18 1.2107 0.2360 1.2107 1.1003
No log 3.3333 20 1.2116 0.3679 1.2116 1.1007
No log 3.6667 22 1.3519 0.0 1.3519 1.1627
No log 4.0 24 1.3695 0.0 1.3695 1.1703
No log 4.3333 26 1.2303 0.1076 1.2303 1.1092
No log 4.6667 28 1.1332 0.3394 1.1332 1.0645
No log 5.0 30 1.0975 0.4072 1.0975 1.0476
No log 5.3333 32 1.0632 0.3201 1.0632 1.0311
No log 5.6667 34 1.0349 0.4072 1.0349 1.0173
No log 6.0 36 1.0109 0.4222 1.0109 1.0054
No log 6.3333 38 1.0531 0.3131 1.0531 1.0262
No log 6.6667 40 1.1234 0.1587 1.1234 1.0599
No log 7.0 42 1.0142 0.2984 1.0142 1.0071
No log 7.3333 44 0.9172 0.3627 0.9172 0.9577
No log 7.6667 46 0.9207 0.3627 0.9207 0.9595
No log 8.0 48 0.9292 0.3724 0.9292 0.9639
No log 8.3333 50 0.9050 0.4418 0.9050 0.9513
No log 8.6667 52 0.9494 0.4835 0.9494 0.9744
No log 9.0 54 0.8738 0.5410 0.8738 0.9348
No log 9.3333 56 0.9880 0.4890 0.9880 0.9940
No log 9.6667 58 1.0896 0.4325 1.0896 1.0439
No log 10.0 60 0.9008 0.4565 0.9008 0.9491
No log 10.3333 62 0.9316 0.4948 0.9316 0.9652
No log 10.6667 64 0.9888 0.4694 0.9888 0.9944
No log 11.0 66 0.8798 0.5291 0.8798 0.9380
No log 11.3333 68 0.9552 0.4420 0.9552 0.9773
No log 11.6667 70 1.1247 0.4216 1.1247 1.0605
No log 12.0 72 1.1350 0.4140 1.1350 1.0654
No log 12.3333 74 0.9840 0.4515 0.9840 0.9920
No log 12.6667 76 0.9046 0.4864 0.9046 0.9511
No log 13.0 78 0.9070 0.4864 0.9070 0.9523
No log 13.3333 80 0.8929 0.5011 0.8929 0.9449
No log 13.6667 82 0.8999 0.5324 0.8999 0.9486
No log 14.0 84 0.9354 0.4970 0.9354 0.9672
No log 14.3333 86 0.9531 0.4316 0.9531 0.9763
No log 14.6667 88 0.9018 0.4672 0.9018 0.9497
No log 15.0 90 0.8518 0.5197 0.8518 0.9229
No log 15.3333 92 1.0451 0.4775 1.0451 1.0223
No log 15.6667 94 0.9610 0.4657 0.9610 0.9803
No log 16.0 96 0.8465 0.5431 0.8465 0.9200
No log 16.3333 98 1.0238 0.4862 1.0238 1.0118
No log 16.6667 100 1.0679 0.4440 1.0679 1.0334
No log 17.0 102 0.9278 0.4113 0.9278 0.9632
No log 17.3333 104 0.8603 0.4995 0.8603 0.9275
No log 17.6667 106 0.8593 0.4865 0.8593 0.9270
No log 18.0 108 0.8815 0.4611 0.8815 0.9389
No log 18.3333 110 0.8691 0.4611 0.8691 0.9322
No log 18.6667 112 0.8700 0.4611 0.8700 0.9327
No log 19.0 114 0.8544 0.5283 0.8544 0.9243
No log 19.3333 116 0.8604 0.4995 0.8604 0.9276
No log 19.6667 118 0.9822 0.3958 0.9822 0.9910
No log 20.0 120 1.1397 0.4156 1.1397 1.0676
No log 20.3333 122 1.1775 0.4050 1.1775 1.0851
No log 20.6667 124 1.0273 0.4851 1.0273 1.0135
No log 21.0 126 0.9102 0.4726 0.9102 0.9540
No log 21.3333 128 0.8884 0.4643 0.8884 0.9425
No log 21.6667 130 0.8879 0.4912 0.8879 0.9423
No log 22.0 132 0.8604 0.4962 0.8604 0.9276
No log 22.3333 134 0.8580 0.5329 0.8580 0.9263
No log 22.6667 136 0.9431 0.4166 0.9431 0.9711
No log 23.0 138 1.0388 0.3068 1.0388 1.0192
No log 23.3333 140 1.0089 0.3596 1.0089 1.0045
No log 23.6667 142 0.9271 0.3885 0.9271 0.9629
No log 24.0 144 0.8642 0.4865 0.8642 0.9296
No log 24.3333 146 0.8450 0.5027 0.8450 0.9192
No log 24.6667 148 0.8628 0.5184 0.8628 0.9289
No log 25.0 150 0.9412 0.3990 0.9412 0.9701
No log 25.3333 152 1.0010 0.3758 1.0010 1.0005
No log 25.6667 154 0.9457 0.3986 0.9457 0.9725
No log 26.0 156 0.8901 0.4884 0.8901 0.9434
No log 26.3333 158 0.8875 0.4884 0.8875 0.9421
No log 26.6667 160 0.8789 0.4898 0.8789 0.9375
No log 27.0 162 0.8893 0.4884 0.8893 0.9430
No log 27.3333 164 0.9250 0.3919 0.9250 0.9618
No log 27.6667 166 0.9262 0.4275 0.9262 0.9624
No log 28.0 168 0.9076 0.4527 0.9076 0.9527
No log 28.3333 170 0.9137 0.4430 0.9137 0.9559
No log 28.6667 172 0.9248 0.4430 0.9248 0.9617
No log 29.0 174 0.9553 0.4698 0.9553 0.9774
No log 29.3333 176 0.9401 0.4426 0.9401 0.9696
No log 29.6667 178 0.9387 0.4430 0.9387 0.9688
No log 30.0 180 0.9444 0.4304 0.9444 0.9718
No log 30.3333 182 0.9534 0.4304 0.9534 0.9764
No log 30.6667 184 0.9441 0.4306 0.9441 0.9716
No log 31.0 186 0.9376 0.4308 0.9376 0.9683
No log 31.3333 188 0.9376 0.4311 0.9376 0.9683
No log 31.6667 190 0.9217 0.4311 0.9217 0.9601
No log 32.0 192 0.9137 0.4568 0.9137 0.9559
No log 32.3333 194 0.9027 0.4560 0.9027 0.9501
No log 32.6667 196 0.8937 0.4560 0.8937 0.9454
No log 33.0 198 0.8940 0.4554 0.8940 0.9455
No log 33.3333 200 0.9354 0.4590 0.9354 0.9672
No log 33.6667 202 1.0013 0.4119 1.0013 1.0006
No log 34.0 204 0.9698 0.3986 0.9698 0.9848
No log 34.3333 206 0.9228 0.4144 0.9228 0.9606
No log 34.6667 208 0.8797 0.4898 0.8797 0.9379
No log 35.0 210 0.8679 0.4898 0.8679 0.9316
No log 35.3333 212 0.8859 0.5059 0.8859 0.9412
No log 35.6667 214 0.9201 0.4356 0.9201 0.9592
No log 36.0 216 0.9153 0.4234 0.9153 0.9567
No log 36.3333 218 0.9102 0.4234 0.9102 0.9541
No log 36.6667 220 0.9017 0.4235 0.9017 0.9496
No log 37.0 222 0.8957 0.4235 0.8957 0.9464
No log 37.3333 224 0.9042 0.4235 0.9042 0.9509
No log 37.6667 226 0.9185 0.3986 0.9185 0.9584
No log 38.0 228 0.9544 0.3990 0.9544 0.9769
No log 38.3333 230 0.9500 0.4356 0.9500 0.9747
No log 38.6667 232 0.9020 0.4435 0.9020 0.9497
No log 39.0 234 0.8922 0.4440 0.8922 0.9446
No log 39.3333 236 0.8983 0.4308 0.8983 0.9478
No log 39.6667 238 0.9020 0.4311 0.9020 0.9497
No log 40.0 240 0.8817 0.4705 0.8817 0.9390
No log 40.3333 242 0.9031 0.4962 0.9031 0.9503
No log 40.6667 244 0.9057 0.5438 0.9057 0.9517
No log 41.0 246 0.8763 0.5089 0.8763 0.9361
No log 41.3333 248 0.8861 0.4644 0.8861 0.9413
No log 41.6667 250 0.9835 0.4423 0.9835 0.9917
No log 42.0 252 1.0246 0.3888 1.0246 1.0122
No log 42.3333 254 0.9725 0.4638 0.9725 0.9862
No log 42.6667 256 0.9167 0.4270 0.9167 0.9574
No log 43.0 258 0.8870 0.5011 0.8870 0.9418
No log 43.3333 260 0.8896 0.5027 0.8896 0.9432
No log 43.6667 262 0.9207 0.4366 0.9207 0.9595
No log 44.0 264 0.9438 0.4237 0.9438 0.9715
No log 44.3333 266 0.9532 0.3635 0.9532 0.9763
No log 44.6667 268 0.9285 0.4363 0.9285 0.9636
No log 45.0 270 0.8987 0.5089 0.8987 0.9480
No log 45.3333 272 0.8888 0.5089 0.8888 0.9428
No log 45.6667 274 0.8757 0.5089 0.8757 0.9358
No log 46.0 276 0.8832 0.4840 0.8832 0.9398
No log 46.3333 278 0.9009 0.4714 0.9009 0.9492
No log 46.6667 280 0.8839 0.4714 0.8839 0.9402
No log 47.0 282 0.8697 0.5011 0.8697 0.9326
No log 47.3333 284 0.8718 0.5262 0.8718 0.9337
No log 47.6667 286 0.8811 0.5011 0.8811 0.9387
No log 48.0 288 0.9107 0.4493 0.9107 0.9543
No log 48.3333 290 0.9792 0.3285 0.9792 0.9896
No log 48.6667 292 1.0115 0.3173 1.0115 1.0057
No log 49.0 294 1.0083 0.3173 1.0083 1.0041
No log 49.3333 296 0.9531 0.3584 0.9531 0.9762
No log 49.6667 298 0.9030 0.5155 0.9030 0.9503
No log 50.0 300 0.9378 0.5089 0.9378 0.9684
No log 50.3333 302 0.9585 0.4815 0.9585 0.9790
No log 50.6667 304 0.9339 0.5089 0.9339 0.9664
No log 51.0 306 0.9066 0.5155 0.9066 0.9522
No log 51.3333 308 0.9218 0.4995 0.9218 0.9601
No log 51.6667 310 0.9434 0.3880 0.9434 0.9713
No log 52.0 312 0.9545 0.3602 0.9545 0.9770
No log 52.3333 314 0.9407 0.3590 0.9407 0.9699
No log 52.6667 316 0.9177 0.3874 0.9177 0.9580
No log 53.0 318 0.9033 0.3914 0.9033 0.9504
No log 53.3333 320 0.9053 0.3886 0.9053 0.9515
No log 53.6667 322 0.9359 0.3624 0.9359 0.9674
No log 54.0 324 0.9462 0.3725 0.9462 0.9727
No log 54.3333 326 0.9153 0.3848 0.9153 0.9567
No log 54.6667 328 0.8745 0.4965 0.8745 0.9352
No log 55.0 330 0.8535 0.5213 0.8535 0.9239
No log 55.3333 332 0.8477 0.5458 0.8477 0.9207
No log 55.6667 334 0.8477 0.5458 0.8477 0.9207
No log 56.0 336 0.8627 0.5089 0.8627 0.9288
No log 56.3333 338 0.8928 0.4202 0.8928 0.9449
No log 56.6667 340 0.8872 0.4363 0.8872 0.9419
No log 57.0 342 0.8642 0.5089 0.8642 0.9296
No log 57.3333 344 0.8385 0.5381 0.8385 0.9157
No log 57.6667 346 0.8365 0.5508 0.8365 0.9146
No log 58.0 348 0.8388 0.5381 0.8388 0.9158
No log 58.3333 350 0.8453 0.5011 0.8453 0.9194
No log 58.6667 352 0.8624 0.4859 0.8624 0.9287
No log 59.0 354 0.8591 0.4767 0.8591 0.9269
No log 59.3333 356 0.8573 0.4778 0.8573 0.9259
No log 59.6667 358 0.8613 0.4996 0.8613 0.9280
No log 60.0 360 0.8713 0.4996 0.8713 0.9334
No log 60.3333 362 0.8793 0.4996 0.8793 0.9377
No log 60.6667 364 0.9012 0.4521 0.9012 0.9493
No log 61.0 366 0.9195 0.4016 0.9195 0.9589
No log 61.3333 368 0.9242 0.4016 0.9242 0.9613
No log 61.6667 370 0.9105 0.4013 0.9105 0.9542
No log 62.0 372 0.8902 0.4499 0.8902 0.9435
No log 62.3333 374 0.8857 0.4852 0.8857 0.9411
No log 62.6667 376 0.9032 0.4369 0.9032 0.9503
No log 63.0 378 0.9152 0.3841 0.9152 0.9566
No log 63.3333 380 0.9114 0.4107 0.9114 0.9547
No log 63.6667 382 0.8881 0.4965 0.8881 0.9424
No log 64.0 384 0.8776 0.4996 0.8776 0.9368
No log 64.3333 386 0.8706 0.5336 0.8706 0.9331
No log 64.6667 388 0.8696 0.5336 0.8696 0.9325
No log 65.0 390 0.8703 0.4996 0.8703 0.9329
No log 65.3333 392 0.8843 0.5089 0.8843 0.9404
No log 65.6667 394 0.9042 0.4965 0.9042 0.9509
No log 66.0 396 0.9083 0.4724 0.9083 0.9530
No log 66.3333 398 0.9168 0.4369 0.9168 0.9575
No log 66.6667 400 0.9161 0.4369 0.9161 0.9571
No log 67.0 402 0.8964 0.5089 0.8964 0.9468
No log 67.3333 404 0.8833 0.4996 0.8833 0.9398
No log 67.6667 406 0.8849 0.5137 0.8849 0.9407
No log 68.0 408 0.8878 0.4996 0.8878 0.9423
No log 68.3333 410 0.8958 0.4996 0.8958 0.9464
No log 68.6667 412 0.9079 0.5089 0.9079 0.9528
No log 69.0 414 0.9152 0.4401 0.9152 0.9567
No log 69.3333 416 0.9039 0.4996 0.9039 0.9507
No log 69.6667 418 0.8926 0.5011 0.8926 0.9448
No log 70.0 420 0.8905 0.5011 0.8905 0.9437
No log 70.3333 422 0.8916 0.5011 0.8916 0.9442
No log 70.6667 424 0.8970 0.5089 0.8970 0.9471
No log 71.0 426 0.9072 0.4746 0.9072 0.9525
No log 71.3333 428 0.9209 0.4055 0.9209 0.9596
No log 71.6667 430 0.9334 0.3764 0.9334 0.9661
No log 72.0 432 0.9465 0.3854 0.9465 0.9729
No log 72.3333 434 0.9302 0.4111 0.9302 0.9645
No log 72.6667 436 0.8996 0.4965 0.8996 0.9485
No log 73.0 438 0.8720 0.5107 0.8720 0.9338
No log 73.3333 440 0.8637 0.5027 0.8637 0.9293
No log 73.6667 442 0.8637 0.5027 0.8637 0.9293
No log 74.0 444 0.8692 0.4996 0.8692 0.9323
No log 74.3333 446 0.8797 0.5089 0.8797 0.9379
No log 74.6667 448 0.8891 0.5089 0.8891 0.9429
No log 75.0 450 0.8969 0.5287 0.8969 0.9471
No log 75.3333 452 0.9068 0.4816 0.9068 0.9522
No log 75.6667 454 0.9051 0.4816 0.9051 0.9513
No log 76.0 456 0.9056 0.4816 0.9056 0.9516
No log 76.3333 458 0.9030 0.4366 0.9030 0.9503
No log 76.6667 460 0.9013 0.4499 0.9013 0.9493
No log 77.0 462 0.9071 0.4366 0.9071 0.9524
No log 77.3333 464 0.9147 0.4363 0.9147 0.9564
No log 77.6667 466 0.9231 0.4572 0.9231 0.9608
No log 78.0 468 0.9230 0.4237 0.9230 0.9607
No log 78.3333 470 0.9159 0.4366 0.9159 0.9570
No log 78.6667 472 0.9088 0.4369 0.9088 0.9533
No log 79.0 474 0.9016 0.4852 0.9016 0.9495
No log 79.3333 476 0.9035 0.4757 0.9035 0.9505
No log 79.6667 478 0.9088 0.4852 0.9088 0.9533
No log 80.0 480 0.9057 0.4996 0.9057 0.9517
No log 80.3333 482 0.9025 0.4996 0.9025 0.9500
No log 80.6667 484 0.9061 0.4996 0.9061 0.9519
No log 81.0 486 0.9084 0.4401 0.9084 0.9531
No log 81.3333 488 0.9167 0.4273 0.9167 0.9575
No log 81.6667 490 0.9267 0.4108 0.9267 0.9627
No log 82.0 492 0.9231 0.3880 0.9231 0.9608
No log 82.3333 494 0.9155 0.4013 0.9155 0.9568
No log 82.6667 496 0.9138 0.4013 0.9138 0.9559
No log 83.0 498 0.9043 0.4401 0.9043 0.9510
0.2423 83.3333 500 0.8954 0.4996 0.8954 0.9462
0.2423 83.6667 502 0.8909 0.4996 0.8909 0.9439
0.2423 84.0 504 0.8935 0.4996 0.8935 0.9453
0.2423 84.3333 506 0.8957 0.5089 0.8957 0.9464
0.2423 84.6667 508 0.8961 0.5089 0.8961 0.9466
0.2423 85.0 510 0.8924 0.4996 0.8924 0.9446
0.2423 85.3333 512 0.8927 0.4757 0.8927 0.9448
0.2423 85.6667 514 0.8906 0.4757 0.8906 0.9437
0.2423 86.0 516 0.8883 0.4996 0.8883 0.9425
0.2423 86.3333 518 0.8870 0.5119 0.8870 0.9418
0.2423 86.6667 520 0.8863 0.5242 0.8863 0.9414
0.2423 87.0 522 0.8855 0.5242 0.8855 0.9410
0.2423 87.3333 524 0.8857 0.5242 0.8857 0.9411
0.2423 87.6667 526 0.8876 0.5242 0.8876 0.9421
0.2423 88.0 528 0.8936 0.4852 0.8936 0.9453
0.2423 88.3333 530 0.9037 0.4499 0.9037 0.9506
0.2423 88.6667 532 0.9224 0.4237 0.9224 0.9604
0.2423 89.0 534 0.9421 0.3771 0.9421 0.9706
0.2423 89.3333 536 0.9552 0.3771 0.9552 0.9774

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.4.0+cu118
  • Datasets 2.21.0
  • Tokenizers 0.19.1
Downloads last month
2
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for MayBashendy/ArabicNewSplits7_FineTuningAraBERT_run3_AugV5_k1_task2_organization

Finetuned
(4023)
this model