ArabicNewSplits7_usingALLEssays_FineTuningAraBERT_run2_AugV5_k1_task5_organization

This model is a fine-tuned version of aubmindlab/bert-base-arabertv02 on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 0.7176
  • Qwk: 0.5786
  • Mse: 0.7176
  • Rmse: 0.8471

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 100

Training results

Training Loss Epoch Step Validation Loss Qwk Mse Rmse
No log 0.4 2 3.8382 0.0024 3.8382 1.9591
No log 0.8 4 1.8005 -0.0046 1.8005 1.3418
No log 1.2 6 1.2756 0.0380 1.2756 1.1294
No log 1.6 8 0.9884 0.3288 0.9884 0.9942
No log 2.0 10 1.0934 0.1203 1.0934 1.0456
No log 2.4 12 1.1241 0.0445 1.1241 1.0602
No log 2.8 14 0.9857 0.1076 0.9857 0.9928
No log 3.2 16 0.9150 0.3094 0.9150 0.9565
No log 3.6 18 0.9283 0.2740 0.9283 0.9635
No log 4.0 20 0.9558 0.3365 0.9558 0.9776
No log 4.4 22 1.0006 0.3813 1.0006 1.0003
No log 4.8 24 0.9706 0.3243 0.9706 0.9852
No log 5.2 26 0.9353 0.2670 0.9353 0.9671
No log 5.6 28 0.9062 0.3666 0.9062 0.9519
No log 6.0 30 0.8772 0.3284 0.8772 0.9366
No log 6.4 32 0.8651 0.4181 0.8651 0.9301
No log 6.8 34 0.8172 0.4794 0.8172 0.9040
No log 7.2 36 0.7994 0.4856 0.7994 0.8941
No log 7.6 38 0.7923 0.5081 0.7923 0.8901
No log 8.0 40 0.7787 0.5416 0.7787 0.8824
No log 8.4 42 0.8595 0.4920 0.8595 0.9271
No log 8.8 44 0.7800 0.6002 0.7800 0.8832
No log 9.2 46 0.8010 0.5668 0.8010 0.8950
No log 9.6 48 0.7995 0.4840 0.7995 0.8941
No log 10.0 50 0.7166 0.5343 0.7166 0.8465
No log 10.4 52 0.7205 0.5463 0.7205 0.8488
No log 10.8 54 0.8152 0.4604 0.8152 0.9029
No log 11.2 56 0.9522 0.4581 0.9522 0.9758
No log 11.6 58 0.8353 0.4815 0.8353 0.9139
No log 12.0 60 0.7623 0.5888 0.7623 0.8731
No log 12.4 62 0.8340 0.5918 0.8340 0.9132
No log 12.8 64 0.7526 0.5773 0.7526 0.8675
No log 13.2 66 0.7967 0.4648 0.7967 0.8926
No log 13.6 68 0.9656 0.5087 0.9656 0.9827
No log 14.0 70 1.0849 0.4948 1.0849 1.0416
No log 14.4 72 0.9503 0.4152 0.9503 0.9748
No log 14.8 74 0.7793 0.4419 0.7793 0.8828
No log 15.2 76 0.7711 0.5057 0.7711 0.8781
No log 15.6 78 0.7976 0.4777 0.7976 0.8931
No log 16.0 80 0.7876 0.5107 0.7876 0.8875
No log 16.4 82 0.7822 0.5484 0.7822 0.8844
No log 16.8 84 0.9296 0.4490 0.9296 0.9642
No log 17.2 86 0.9086 0.4490 0.9086 0.9532
No log 17.6 88 0.7450 0.5419 0.7450 0.8631
No log 18.0 90 0.8070 0.4671 0.8070 0.8983
No log 18.4 92 0.8406 0.4582 0.8406 0.9168
No log 18.8 94 0.7251 0.5517 0.7251 0.8516
No log 19.2 96 0.7341 0.5279 0.7341 0.8568
No log 19.6 98 0.8239 0.4920 0.8239 0.9077
No log 20.0 100 0.7645 0.5736 0.7645 0.8744
No log 20.4 102 0.7653 0.5567 0.7653 0.8748
No log 20.8 104 0.8343 0.5542 0.8343 0.9134
No log 21.2 106 0.7524 0.4944 0.7524 0.8674
No log 21.6 108 0.7663 0.4954 0.7663 0.8754
No log 22.0 110 0.8204 0.4695 0.8204 0.9058
No log 22.4 112 0.8365 0.4799 0.8365 0.9146
No log 22.8 114 0.7721 0.5173 0.7721 0.8787
No log 23.2 116 0.7269 0.5797 0.7269 0.8526
No log 23.6 118 0.7297 0.5797 0.7297 0.8542
No log 24.0 120 0.7512 0.5317 0.7512 0.8667
No log 24.4 122 0.7528 0.5700 0.7528 0.8676
No log 24.8 124 0.7550 0.5855 0.7550 0.8689
No log 25.2 126 0.7831 0.4509 0.7831 0.8849
No log 25.6 128 0.7699 0.3785 0.7699 0.8774
No log 26.0 130 0.7569 0.5510 0.7569 0.8700
No log 26.4 132 0.7594 0.5305 0.7594 0.8714
No log 26.8 134 0.7434 0.5197 0.7434 0.8622
No log 27.2 136 0.7237 0.5666 0.7237 0.8507
No log 27.6 138 0.7316 0.5546 0.7316 0.8553
No log 28.0 140 0.7301 0.5866 0.7301 0.8544
No log 28.4 142 0.7600 0.5183 0.7600 0.8718
No log 28.8 144 0.7710 0.5076 0.7710 0.8780
No log 29.2 146 0.7524 0.4321 0.7524 0.8674
No log 29.6 148 0.7510 0.5204 0.7510 0.8666
No log 30.0 150 0.7521 0.5329 0.7521 0.8673
No log 30.4 152 0.7568 0.5877 0.7568 0.8699
No log 30.8 154 0.7693 0.5404 0.7693 0.8771
No log 31.2 156 0.7845 0.5291 0.7845 0.8857
No log 31.6 158 0.7549 0.5404 0.7549 0.8689
No log 32.0 160 0.7419 0.5898 0.7419 0.8613
No log 32.4 162 0.7771 0.4414 0.7771 0.8815
No log 32.8 164 0.7543 0.4733 0.7543 0.8685
No log 33.2 166 0.7598 0.5304 0.7598 0.8716
No log 33.6 168 0.7841 0.5175 0.7841 0.8855
No log 34.0 170 0.7639 0.5422 0.7639 0.8740
No log 34.4 172 0.7619 0.4252 0.7619 0.8729
No log 34.8 174 0.7744 0.4399 0.7744 0.8800
No log 35.2 176 0.7641 0.4505 0.7641 0.8741
No log 35.6 178 0.7462 0.6032 0.7462 0.8639
No log 36.0 180 0.7831 0.6066 0.7831 0.8849
No log 36.4 182 0.7892 0.5776 0.7892 0.8884
No log 36.8 184 0.7510 0.5902 0.7510 0.8666
No log 37.2 186 0.7400 0.5654 0.7400 0.8603
No log 37.6 188 0.7256 0.5874 0.7256 0.8518
No log 38.0 190 0.7302 0.5763 0.7302 0.8545
No log 38.4 192 0.7550 0.5676 0.7550 0.8689
No log 38.8 194 0.7711 0.5003 0.7711 0.8781
No log 39.2 196 0.7906 0.5217 0.7906 0.8892
No log 39.6 198 0.7753 0.4980 0.7753 0.8805
No log 40.0 200 0.7448 0.5246 0.7448 0.8630
No log 40.4 202 0.7323 0.5902 0.7323 0.8557
No log 40.8 204 0.7369 0.5610 0.7369 0.8584
No log 41.2 206 0.7238 0.5614 0.7238 0.8508
No log 41.6 208 0.7195 0.5861 0.7195 0.8483
No log 42.0 210 0.7384 0.4414 0.7384 0.8593
No log 42.4 212 0.7280 0.4856 0.7280 0.8532
No log 42.8 214 0.7058 0.5774 0.7058 0.8401
No log 43.2 216 0.7617 0.5279 0.7617 0.8728
No log 43.6 218 0.8058 0.5033 0.8058 0.8976
No log 44.0 220 0.7826 0.5033 0.7826 0.8846
No log 44.4 222 0.7240 0.5534 0.7240 0.8509
No log 44.8 224 0.7058 0.5261 0.7058 0.8401
No log 45.2 226 0.7015 0.5373 0.7015 0.8376
No log 45.6 228 0.7118 0.5552 0.7118 0.8437
No log 46.0 230 0.7752 0.5137 0.7752 0.8805
No log 46.4 232 0.7650 0.5137 0.7650 0.8746
No log 46.8 234 0.7226 0.5510 0.7226 0.8500
No log 47.2 236 0.6916 0.5719 0.6916 0.8316
No log 47.6 238 0.6681 0.5540 0.6681 0.8174
No log 48.0 240 0.6720 0.5540 0.6720 0.8198
No log 48.4 242 0.7065 0.5642 0.7065 0.8405
No log 48.8 244 0.7229 0.5504 0.7229 0.8503
No log 49.2 246 0.7358 0.5266 0.7358 0.8578
No log 49.6 248 0.7348 0.5266 0.7348 0.8572
No log 50.0 250 0.7703 0.5137 0.7703 0.8777
No log 50.4 252 0.7630 0.5254 0.7630 0.8735
No log 50.8 254 0.7213 0.5510 0.7213 0.8493
No log 51.2 256 0.7065 0.5751 0.7065 0.8405
No log 51.6 258 0.7100 0.6092 0.7100 0.8426
No log 52.0 260 0.7289 0.5318 0.7289 0.8538
No log 52.4 262 0.7476 0.5173 0.7476 0.8647
No log 52.8 264 0.7476 0.5046 0.7476 0.8647
No log 53.2 266 0.7277 0.5368 0.7277 0.8530
No log 53.6 268 0.6969 0.5719 0.6969 0.8348
No log 54.0 270 0.6898 0.6094 0.6898 0.8305
No log 54.4 272 0.6958 0.5575 0.6958 0.8341
No log 54.8 274 0.6943 0.5891 0.6943 0.8332
No log 55.2 276 0.7072 0.5317 0.7072 0.8410
No log 55.6 278 0.7164 0.5858 0.7164 0.8464
No log 56.0 280 0.7116 0.5880 0.7116 0.8435
No log 56.4 282 0.7140 0.5880 0.7140 0.8450
No log 56.8 284 0.7278 0.5986 0.7278 0.8531
No log 57.2 286 0.7376 0.5763 0.7376 0.8588
No log 57.6 288 0.7347 0.5763 0.7347 0.8571
No log 58.0 290 0.7179 0.6014 0.7179 0.8473
No log 58.4 292 0.7088 0.5917 0.7088 0.8419
No log 58.8 294 0.7042 0.5917 0.7042 0.8392
No log 59.2 296 0.7134 0.5996 0.7134 0.8446
No log 59.6 298 0.7462 0.5521 0.7462 0.8638
No log 60.0 300 0.7790 0.5491 0.7790 0.8826
No log 60.4 302 0.8139 0.4681 0.8139 0.9022
No log 60.8 304 0.8218 0.4681 0.8218 0.9066
No log 61.2 306 0.8097 0.5027 0.8097 0.8998
No log 61.6 308 0.7679 0.5509 0.7679 0.8763
No log 62.0 310 0.7170 0.5700 0.7170 0.8468
No log 62.4 312 0.7059 0.6065 0.7059 0.8402
No log 62.8 314 0.7081 0.5724 0.7081 0.8415
No log 63.2 316 0.7119 0.5701 0.7119 0.8437
No log 63.6 318 0.7203 0.6084 0.7203 0.8487
No log 64.0 320 0.7376 0.5944 0.7376 0.8588
No log 64.4 322 0.7441 0.5944 0.7441 0.8626
No log 64.8 324 0.7487 0.5944 0.7487 0.8653
No log 65.2 326 0.7378 0.6084 0.7378 0.8590
No log 65.6 328 0.7383 0.5557 0.7383 0.8592
No log 66.0 330 0.7705 0.4196 0.7705 0.8778
No log 66.4 332 0.7817 0.4196 0.7817 0.8841
No log 66.8 334 0.7485 0.4017 0.7485 0.8652
No log 67.2 336 0.7241 0.5902 0.7241 0.8509
No log 67.6 338 0.7402 0.5522 0.7402 0.8603
No log 68.0 340 0.7700 0.5385 0.7700 0.8775
No log 68.4 342 0.7731 0.5400 0.7731 0.8793
No log 68.8 344 0.7664 0.5527 0.7664 0.8754
No log 69.2 346 0.7621 0.5306 0.7621 0.8730
No log 69.6 348 0.7532 0.5696 0.7532 0.8679
No log 70.0 350 0.7496 0.5799 0.7496 0.8658
No log 70.4 352 0.7425 0.5799 0.7425 0.8617
No log 70.8 354 0.7318 0.5787 0.7318 0.8555
No log 71.2 356 0.7290 0.5975 0.7290 0.8538
No log 71.6 358 0.7340 0.5494 0.7340 0.8567
No log 72.0 360 0.7281 0.5699 0.7281 0.8533
No log 72.4 362 0.7158 0.6028 0.7158 0.8461
No log 72.8 364 0.7030 0.6256 0.7030 0.8385
No log 73.2 366 0.7017 0.6104 0.7017 0.8377
No log 73.6 368 0.7049 0.6383 0.7049 0.8396
No log 74.0 370 0.7048 0.6104 0.7048 0.8395
No log 74.4 372 0.7082 0.6256 0.7082 0.8416
No log 74.8 374 0.7187 0.6055 0.7187 0.8477
No log 75.2 376 0.7238 0.5730 0.7238 0.8508
No log 75.6 378 0.7263 0.5730 0.7263 0.8522
No log 76.0 380 0.7209 0.5730 0.7209 0.8491
No log 76.4 382 0.7107 0.5730 0.7107 0.8430
No log 76.8 384 0.7043 0.6380 0.7043 0.8392
No log 77.2 386 0.7010 0.6278 0.7010 0.8372
No log 77.6 388 0.7011 0.6055 0.7011 0.8373
No log 78.0 390 0.7000 0.5969 0.7000 0.8367
No log 78.4 392 0.7025 0.5996 0.7025 0.8382
No log 78.8 394 0.7123 0.6073 0.7123 0.8440
No log 79.2 396 0.7169 0.5666 0.7169 0.8467
No log 79.6 398 0.7214 0.5318 0.7214 0.8493
No log 80.0 400 0.7216 0.5413 0.7216 0.8495
No log 80.4 402 0.7207 0.5763 0.7207 0.8489
No log 80.8 404 0.7134 0.5676 0.7134 0.8447
No log 81.2 406 0.7057 0.5786 0.7057 0.8400
No log 81.6 408 0.7028 0.5996 0.7028 0.8383
No log 82.0 410 0.7009 0.5996 0.7009 0.8372
No log 82.4 412 0.7067 0.5786 0.7067 0.8406
No log 82.8 414 0.7096 0.5786 0.7096 0.8424
No log 83.2 416 0.7134 0.5786 0.7134 0.8447
No log 83.6 418 0.7180 0.5317 0.7180 0.8473
No log 84.0 420 0.7128 0.5786 0.7128 0.8443
No log 84.4 422 0.7043 0.5996 0.7043 0.8392
No log 84.8 424 0.7018 0.6025 0.7018 0.8377
No log 85.2 426 0.7044 0.5996 0.7044 0.8393
No log 85.6 428 0.7108 0.5969 0.7108 0.8431
No log 86.0 430 0.7182 0.5763 0.7182 0.8474
No log 86.4 432 0.7193 0.5763 0.7193 0.8481
No log 86.8 434 0.7154 0.5810 0.7154 0.8458
No log 87.2 436 0.7131 0.5822 0.7131 0.8444
No log 87.6 438 0.7110 0.6025 0.7110 0.8432
No log 88.0 440 0.7096 0.5996 0.7096 0.8424
No log 88.4 442 0.7098 0.5996 0.7098 0.8425
No log 88.8 444 0.7137 0.5996 0.7137 0.8448
No log 89.2 446 0.7223 0.5959 0.7223 0.8499
No log 89.6 448 0.7338 0.5380 0.7338 0.8566
No log 90.0 450 0.7410 0.5470 0.7410 0.8608
No log 90.4 452 0.7418 0.5470 0.7418 0.8613
No log 90.8 454 0.7364 0.5366 0.7364 0.8582
No log 91.2 456 0.7306 0.5844 0.7306 0.8547
No log 91.6 458 0.7241 0.5986 0.7241 0.8509
No log 92.0 460 0.7211 0.5786 0.7211 0.8492
No log 92.4 462 0.7208 0.5786 0.7208 0.8490
No log 92.8 464 0.7224 0.5786 0.7224 0.8499
No log 93.2 466 0.7225 0.5786 0.7225 0.8500
No log 93.6 468 0.7218 0.5786 0.7218 0.8496
No log 94.0 470 0.7207 0.5786 0.7207 0.8490
No log 94.4 472 0.7190 0.5786 0.7190 0.8479
No log 94.8 474 0.7192 0.5786 0.7192 0.8481
No log 95.2 476 0.7210 0.5786 0.7210 0.8491
No log 95.6 478 0.7210 0.5786 0.7210 0.8491
No log 96.0 480 0.7205 0.5786 0.7205 0.8488
No log 96.4 482 0.7209 0.5786 0.7209 0.8490
No log 96.8 484 0.7208 0.5786 0.7208 0.8490
No log 97.2 486 0.7204 0.5786 0.7204 0.8488
No log 97.6 488 0.7200 0.5786 0.7200 0.8485
No log 98.0 490 0.7196 0.5786 0.7196 0.8483
No log 98.4 492 0.7186 0.5786 0.7186 0.8477
No log 98.8 494 0.7181 0.5786 0.7181 0.8474
No log 99.2 496 0.7177 0.5786 0.7177 0.8472
No log 99.6 498 0.7176 0.5786 0.7176 0.8471
0.1586 100.0 500 0.7176 0.5786 0.7176 0.8471

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.4.0+cu118
  • Datasets 2.21.0
  • Tokenizers 0.19.1
Downloads last month
-
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for MayBashendy/ArabicNewSplits7_usingALLEssays_FineTuningAraBERT_run2_AugV5_k1_task5_organization

Finetuned
(4019)
this model