ArabicNewSplits7_OSS_usingWellWrittenEssays_FineTuningAraBERT_run3_AugV5_k1_task7_organization

This model is a fine-tuned version of aubmindlab/bert-base-arabertv02 on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 0.4741
  • Qwk: 0.6277
  • Mse: 0.4741
  • Rmse: 0.6885

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 100

Training results

Training Loss Epoch Step Validation Loss Qwk Mse Rmse
No log 0.4 2 2.6379 -0.0729 2.6379 1.6242
No log 0.8 4 1.3491 0.0998 1.3491 1.1615
No log 1.2 6 0.7540 0.1786 0.7540 0.8684
No log 1.6 8 0.9388 0.1385 0.9388 0.9689
No log 2.0 10 0.8605 0.2939 0.8605 0.9277
No log 2.4 12 0.7241 0.1561 0.7241 0.8509
No log 2.8 14 0.7262 0.1187 0.7262 0.8522
No log 3.2 16 0.7067 0.2537 0.7067 0.8407
No log 3.6 18 0.6704 0.3950 0.6704 0.8188
No log 4.0 20 0.5962 0.3950 0.5962 0.7722
No log 4.4 22 0.5403 0.4186 0.5403 0.7350
No log 4.8 24 0.5884 0.4222 0.5884 0.7671
No log 5.2 26 0.6777 0.4361 0.6777 0.8232
No log 5.6 28 0.5881 0.5095 0.5881 0.7668
No log 6.0 30 0.5291 0.5368 0.5291 0.7274
No log 6.4 32 0.6249 0.5824 0.6249 0.7905
No log 6.8 34 0.5441 0.4771 0.5441 0.7377
No log 7.2 36 0.8700 0.4764 0.8700 0.9327
No log 7.6 38 1.0004 0.3160 1.0004 1.0002
No log 8.0 40 0.6739 0.5017 0.6739 0.8209
No log 8.4 42 0.5741 0.5633 0.5741 0.7577
No log 8.8 44 0.5900 0.5724 0.5900 0.7681
No log 9.2 46 0.6169 0.5627 0.6169 0.7854
No log 9.6 48 0.4796 0.6463 0.4796 0.6926
No log 10.0 50 0.6010 0.5343 0.6010 0.7753
No log 10.4 52 0.7247 0.4815 0.7247 0.8513
No log 10.8 54 0.5039 0.5696 0.5039 0.7099
No log 11.2 56 0.4521 0.5943 0.4521 0.6724
No log 11.6 58 0.4633 0.6143 0.4633 0.6807
No log 12.0 60 0.5675 0.5328 0.5675 0.7533
No log 12.4 62 0.5702 0.5266 0.5702 0.7551
No log 12.8 64 0.4454 0.5915 0.4454 0.6674
No log 13.2 66 0.5422 0.5170 0.5422 0.7363
No log 13.6 68 0.4945 0.6045 0.4945 0.7032
No log 14.0 70 0.4830 0.4486 0.4830 0.6950
No log 14.4 72 0.5136 0.4569 0.5136 0.7167
No log 14.8 74 0.4680 0.6229 0.4680 0.6841
No log 15.2 76 0.6099 0.4745 0.6099 0.7810
No log 15.6 78 0.7472 0.4930 0.7472 0.8644
No log 16.0 80 0.5719 0.4819 0.5719 0.7562
No log 16.4 82 0.4802 0.5217 0.4802 0.6930
No log 16.8 84 0.4827 0.5184 0.4827 0.6947
No log 17.2 86 0.4706 0.6228 0.4706 0.6860
No log 17.6 88 0.4650 0.6313 0.4650 0.6819
No log 18.0 90 0.4541 0.6452 0.4541 0.6739
No log 18.4 92 0.4573 0.6431 0.4573 0.6763
No log 18.8 94 0.4483 0.6566 0.4483 0.6695
No log 19.2 96 0.4693 0.5538 0.4693 0.6851
No log 19.6 98 0.4872 0.5584 0.4872 0.6980
No log 20.0 100 0.4745 0.5621 0.4745 0.6889
No log 20.4 102 0.4377 0.5390 0.4377 0.6616
No log 20.8 104 0.4430 0.6317 0.4430 0.6656
No log 21.2 106 0.5015 0.5720 0.5015 0.7081
No log 21.6 108 0.4828 0.5467 0.4828 0.6948
No log 22.0 110 0.4663 0.4515 0.4663 0.6828
No log 22.4 112 0.4788 0.5170 0.4788 0.6919
No log 22.8 114 0.5103 0.5836 0.5103 0.7143
No log 23.2 116 0.5217 0.6052 0.5217 0.7223
No log 23.6 118 0.4855 0.5786 0.4855 0.6968
No log 24.0 120 0.4551 0.6027 0.4551 0.6746
No log 24.4 122 0.4530 0.5524 0.4530 0.6730
No log 24.8 124 0.4388 0.6282 0.4388 0.6624
No log 25.2 126 0.4293 0.6344 0.4293 0.6552
No log 25.6 128 0.4314 0.6344 0.4314 0.6568
No log 26.0 130 0.4277 0.6377 0.4277 0.6540
No log 26.4 132 0.4307 0.6407 0.4307 0.6563
No log 26.8 134 0.4631 0.6092 0.4631 0.6805
No log 27.2 136 0.4585 0.6092 0.4585 0.6771
No log 27.6 138 0.4171 0.6383 0.4171 0.6459
No log 28.0 140 0.4121 0.6439 0.4121 0.6419
No log 28.4 142 0.4198 0.6435 0.4198 0.6479
No log 28.8 144 0.4455 0.5714 0.4455 0.6675
No log 29.2 146 0.5125 0.4652 0.5125 0.7159
No log 29.6 148 0.5437 0.5474 0.5437 0.7374
No log 30.0 150 0.4766 0.5861 0.4766 0.6903
No log 30.4 152 0.4308 0.6743 0.4308 0.6564
No log 30.8 154 0.4527 0.6778 0.4527 0.6728
No log 31.2 156 0.4563 0.6778 0.4563 0.6755
No log 31.6 158 0.4565 0.6301 0.4565 0.6756
No log 32.0 160 0.5201 0.5616 0.5201 0.7212
No log 32.4 162 0.5009 0.5014 0.5009 0.7078
No log 32.8 164 0.4554 0.5339 0.4554 0.6749
No log 33.2 166 0.4693 0.5708 0.4693 0.6851
No log 33.6 168 0.5415 0.5787 0.5415 0.7359
No log 34.0 170 0.5624 0.5595 0.5624 0.7500
No log 34.4 172 0.5258 0.5290 0.5258 0.7251
No log 34.8 174 0.4943 0.6256 0.4943 0.7030
No log 35.2 176 0.5151 0.5692 0.5151 0.7177
No log 35.6 178 0.4874 0.5751 0.4874 0.6981
No log 36.0 180 0.4753 0.6118 0.4753 0.6894
No log 36.4 182 0.4691 0.6364 0.4691 0.6849
No log 36.8 184 0.4638 0.5890 0.4638 0.6810
No log 37.2 186 0.5082 0.5249 0.5082 0.7129
No log 37.6 188 0.5487 0.5061 0.5487 0.7407
No log 38.0 190 0.4943 0.5575 0.4943 0.7031
No log 38.4 192 0.4591 0.6650 0.4591 0.6776
No log 38.8 194 0.4966 0.5499 0.4966 0.7047
No log 39.2 196 0.5265 0.5484 0.5265 0.7256
No log 39.6 198 0.5055 0.5708 0.5055 0.7110
No log 40.0 200 0.4792 0.5396 0.4792 0.6922
No log 40.4 202 0.4878 0.4423 0.4878 0.6984
No log 40.8 204 0.4919 0.4678 0.4919 0.7013
No log 41.2 206 0.4801 0.4953 0.4801 0.6929
No log 41.6 208 0.4780 0.6018 0.4780 0.6914
No log 42.0 210 0.4789 0.5923 0.4789 0.6920
No log 42.4 212 0.4711 0.6431 0.4711 0.6863
No log 42.8 214 0.4664 0.6339 0.4664 0.6829
No log 43.2 216 0.4645 0.6184 0.4645 0.6815
No log 43.6 218 0.4633 0.6184 0.4633 0.6807
No log 44.0 220 0.4630 0.5943 0.4630 0.6805
No log 44.4 222 0.4664 0.5587 0.4664 0.6829
No log 44.8 224 0.4704 0.5631 0.4704 0.6858
No log 45.2 226 0.4766 0.5708 0.4766 0.6904
No log 45.6 228 0.4707 0.5732 0.4707 0.6861
No log 46.0 230 0.4694 0.4847 0.4694 0.6851
No log 46.4 232 0.4748 0.4538 0.4748 0.6891
No log 46.8 234 0.4658 0.4795 0.4658 0.6825
No log 47.2 236 0.4587 0.5413 0.4587 0.6773
No log 47.6 238 0.4552 0.5340 0.4552 0.6747
No log 48.0 240 0.4532 0.5846 0.4532 0.6732
No log 48.4 242 0.4562 0.5915 0.4562 0.6755
No log 48.8 244 0.4784 0.5587 0.4784 0.6917
No log 49.2 246 0.5019 0.5438 0.5019 0.7084
No log 49.6 248 0.4794 0.5117 0.4794 0.6924
No log 50.0 250 0.4506 0.5641 0.4506 0.6713
No log 50.4 252 0.4685 0.5732 0.4685 0.6845
No log 50.8 254 0.5142 0.5081 0.5142 0.7171
No log 51.2 256 0.5325 0.5140 0.5325 0.7297
No log 51.6 258 0.4881 0.5457 0.4881 0.6986
No log 52.0 260 0.4482 0.6240 0.4482 0.6695
No log 52.4 262 0.4464 0.6267 0.4464 0.6681
No log 52.8 264 0.4479 0.6388 0.4479 0.6692
No log 53.2 266 0.4521 0.6353 0.4521 0.6724
No log 53.6 268 0.4561 0.6145 0.4561 0.6753
No log 54.0 270 0.4668 0.5995 0.4668 0.6832
No log 54.4 272 0.4714 0.5995 0.4714 0.6866
No log 54.8 274 0.4683 0.6018 0.4683 0.6843
No log 55.2 276 0.4762 0.5801 0.4762 0.6901
No log 55.6 278 0.4745 0.6240 0.4745 0.6888
No log 56.0 280 0.4702 0.6388 0.4702 0.6857
No log 56.4 282 0.4720 0.6059 0.4720 0.6870
No log 56.8 284 0.4738 0.6059 0.4738 0.6883
No log 57.2 286 0.4635 0.5939 0.4635 0.6808
No log 57.6 288 0.4611 0.5731 0.4611 0.6790
No log 58.0 290 0.4660 0.5718 0.4660 0.6827
No log 58.4 292 0.4612 0.5352 0.4612 0.6791
No log 58.8 294 0.4640 0.5399 0.4640 0.6812
No log 59.2 296 0.4764 0.5498 0.4764 0.6902
No log 59.6 298 0.4990 0.5445 0.4990 0.7064
No log 60.0 300 0.4903 0.5445 0.4903 0.7002
No log 60.4 302 0.4670 0.5692 0.4670 0.6834
No log 60.8 304 0.4477 0.6673 0.4477 0.6691
No log 61.2 306 0.4420 0.6530 0.4420 0.6648
No log 61.6 308 0.4527 0.5995 0.4527 0.6728
No log 62.0 310 0.4552 0.5995 0.4552 0.6747
No log 62.4 312 0.4559 0.6214 0.4559 0.6752
No log 62.8 314 0.4493 0.6530 0.4493 0.6703
No log 63.2 316 0.4535 0.5798 0.4535 0.6734
No log 63.6 318 0.4889 0.5095 0.4889 0.6992
No log 64.0 320 0.5204 0.5031 0.5204 0.7214
No log 64.4 322 0.5183 0.5031 0.5183 0.7199
No log 64.8 324 0.4941 0.5095 0.4941 0.7029
No log 65.2 326 0.4663 0.5422 0.4663 0.6829
No log 65.6 328 0.4520 0.6661 0.4520 0.6723
No log 66.0 330 0.4457 0.6639 0.4457 0.6676
No log 66.4 332 0.4471 0.7114 0.4471 0.6687
No log 66.8 334 0.4496 0.6477 0.4496 0.6705
No log 67.2 336 0.4593 0.6383 0.4593 0.6777
No log 67.6 338 0.4704 0.5877 0.4704 0.6859
No log 68.0 340 0.4749 0.5666 0.4749 0.6891
No log 68.4 342 0.4696 0.5189 0.4696 0.6853
No log 68.8 344 0.4587 0.5493 0.4587 0.6773
No log 69.2 346 0.4519 0.6351 0.4519 0.6722
No log 69.6 348 0.4565 0.6627 0.4565 0.6756
No log 70.0 350 0.4605 0.6530 0.4605 0.6786
No log 70.4 352 0.4601 0.6132 0.4601 0.6783
No log 70.8 354 0.4557 0.5868 0.4557 0.6751
No log 71.2 356 0.4544 0.5665 0.4544 0.6741
No log 71.6 358 0.4562 0.5899 0.4562 0.6754
No log 72.0 360 0.4583 0.6125 0.4583 0.6769
No log 72.4 362 0.4604 0.6125 0.4604 0.6785
No log 72.8 364 0.4634 0.6125 0.4634 0.6807
No log 73.2 366 0.4662 0.6125 0.4662 0.6828
No log 73.6 368 0.4746 0.6132 0.4746 0.6889
No log 74.0 370 0.4808 0.5741 0.4808 0.6934
No log 74.4 372 0.4783 0.5708 0.4783 0.6916
No log 74.8 374 0.4737 0.6132 0.4737 0.6883
No log 75.2 376 0.4695 0.5665 0.4695 0.6852
No log 75.6 378 0.4674 0.5600 0.4674 0.6836
No log 76.0 380 0.4676 0.5750 0.4676 0.6838
No log 76.4 382 0.4689 0.5970 0.4689 0.6847
No log 76.8 384 0.4735 0.5736 0.4735 0.6881
No log 77.2 386 0.4772 0.5943 0.4772 0.6908
No log 77.6 388 0.4769 0.5943 0.4769 0.6906
No log 78.0 390 0.4729 0.6407 0.4729 0.6877
No log 78.4 392 0.4754 0.6416 0.4754 0.6895
No log 78.8 394 0.4802 0.6267 0.4802 0.6930
No log 79.2 396 0.4873 0.6195 0.4873 0.6981
No log 79.6 398 0.4945 0.5941 0.4945 0.7032
No log 80.0 400 0.4916 0.5919 0.4916 0.7012
No log 80.4 402 0.4927 0.5919 0.4927 0.7020
No log 80.8 404 0.4853 0.5692 0.4853 0.6966
No log 81.2 406 0.4796 0.5692 0.4796 0.6926
No log 81.6 408 0.4724 0.5663 0.4724 0.6873
No log 82.0 410 0.4653 0.5634 0.4653 0.6821
No log 82.4 412 0.4618 0.6467 0.4618 0.6796
No log 82.8 414 0.4639 0.6121 0.4639 0.6811
No log 83.2 416 0.4662 0.6121 0.4662 0.6828
No log 83.6 418 0.4695 0.6121 0.4695 0.6852
No log 84.0 420 0.4712 0.6121 0.4712 0.6864
No log 84.4 422 0.4733 0.6121 0.4733 0.6879
No log 84.8 424 0.4751 0.5669 0.4751 0.6893
No log 85.2 426 0.4777 0.56 0.4777 0.6911
No log 85.6 428 0.4804 0.56 0.4804 0.6931
No log 86.0 430 0.4787 0.6136 0.4787 0.6919
No log 86.4 432 0.4767 0.6136 0.4767 0.6905
No log 86.8 434 0.4731 0.6286 0.4731 0.6878
No log 87.2 436 0.4704 0.6286 0.4704 0.6859
No log 87.6 438 0.4672 0.6277 0.4672 0.6835
No log 88.0 440 0.4663 0.6277 0.4663 0.6828
No log 88.4 442 0.4674 0.6277 0.4674 0.6836
No log 88.8 444 0.4689 0.5812 0.4689 0.6848
No log 89.2 446 0.4711 0.5812 0.4711 0.6864
No log 89.6 448 0.4744 0.5701 0.4744 0.6887
No log 90.0 450 0.4786 0.5701 0.4786 0.6918
No log 90.4 452 0.4812 0.5634 0.4812 0.6937
No log 90.8 454 0.4836 0.6325 0.4836 0.6954
No log 91.2 456 0.4837 0.6398 0.4837 0.6955
No log 91.6 458 0.4824 0.6398 0.4824 0.6945
No log 92.0 460 0.4809 0.6551 0.4809 0.6935
No log 92.4 462 0.4794 0.6551 0.4794 0.6924
No log 92.8 464 0.4780 0.6286 0.4780 0.6913
No log 93.2 466 0.4768 0.6286 0.4768 0.6905
No log 93.6 468 0.4753 0.6286 0.4753 0.6894
No log 94.0 470 0.4746 0.6472 0.4746 0.6889
No log 94.4 472 0.4750 0.5669 0.4750 0.6892
No log 94.8 474 0.4756 0.5669 0.4756 0.6896
No log 95.2 476 0.4759 0.5669 0.4759 0.6898
No log 95.6 478 0.4750 0.5669 0.4750 0.6892
No log 96.0 480 0.4738 0.6469 0.4738 0.6883
No log 96.4 482 0.4730 0.6277 0.4730 0.6878
No log 96.8 484 0.4730 0.6277 0.4730 0.6878
No log 97.2 486 0.4734 0.6277 0.4734 0.6881
No log 97.6 488 0.4738 0.6277 0.4738 0.6883
No log 98.0 490 0.4739 0.6277 0.4739 0.6884
No log 98.4 492 0.4740 0.6277 0.4740 0.6885
No log 98.8 494 0.4741 0.6277 0.4741 0.6886
No log 99.2 496 0.4740 0.6277 0.4740 0.6885
No log 99.6 498 0.4740 0.6277 0.4740 0.6885
0.1874 100.0 500 0.4741 0.6277 0.4741 0.6885

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.4.0+cu118
  • Datasets 2.21.0
  • Tokenizers 0.19.1
Downloads last month
-
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for MayBashendy/ArabicNewSplits7_OSS_usingWellWrittenEssays_FineTuningAraBERT_run3_AugV5_k1_task7_organization

Finetuned
(4019)
this model