ArabicNewSplits7_B_usingALLEssays_FineTuningAraBERT_run3_AugV5_k1_task7_organization

This model is a fine-tuned version of aubmindlab/bert-base-arabertv02 on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 0.4843
  • Qwk: 0.5567
  • Mse: 0.4843
  • Rmse: 0.6959

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 100

Training results

Training Loss Epoch Step Validation Loss Qwk Mse Rmse
No log 0.3333 2 2.7077 -0.0262 2.7077 1.6455
No log 0.6667 4 1.5778 0.0789 1.5778 1.2561
No log 1.0 6 0.8680 0.1724 0.8680 0.9317
No log 1.3333 8 0.8770 -0.0192 0.8770 0.9365
No log 1.6667 10 1.2562 0.0450 1.2562 1.1208
No log 2.0 12 1.8208 -0.0441 1.8208 1.3494
No log 2.3333 14 1.7375 0.0413 1.7375 1.3182
No log 2.6667 16 0.7436 0.2467 0.7436 0.8623
No log 3.0 18 0.7193 0.3090 0.7193 0.8481
No log 3.3333 20 0.5854 0.2361 0.5854 0.7651
No log 3.6667 22 0.6130 0.4795 0.6130 0.7830
No log 4.0 24 0.9199 0.3163 0.9199 0.9591
No log 4.3333 26 0.8155 0.3747 0.8155 0.9030
No log 4.6667 28 0.5258 0.4966 0.5258 0.7251
No log 5.0 30 0.5507 0.5855 0.5507 0.7421
No log 5.3333 32 0.6733 0.4574 0.6733 0.8205
No log 5.6667 34 0.4946 0.6034 0.4946 0.7033
No log 6.0 36 0.4364 0.6142 0.4364 0.6606
No log 6.3333 38 0.4562 0.5647 0.4562 0.6754
No log 6.6667 40 0.4340 0.5874 0.4340 0.6588
No log 7.0 42 0.4223 0.6467 0.4223 0.6499
No log 7.3333 44 0.4221 0.6661 0.4221 0.6497
No log 7.6667 46 0.4589 0.6052 0.4589 0.6774
No log 8.0 48 0.4552 0.6240 0.4552 0.6747
No log 8.3333 50 0.4137 0.6371 0.4137 0.6432
No log 8.6667 52 0.4692 0.6431 0.4692 0.6850
No log 9.0 54 0.4468 0.6279 0.4468 0.6685
No log 9.3333 56 0.4501 0.6254 0.4501 0.6709
No log 9.6667 58 0.4403 0.6073 0.4403 0.6636
No log 10.0 60 0.4722 0.6305 0.4722 0.6872
No log 10.3333 62 0.6593 0.5489 0.6593 0.8120
No log 10.6667 64 0.6474 0.5489 0.6474 0.8046
No log 11.0 66 0.5312 0.6390 0.5312 0.7289
No log 11.3333 68 0.4468 0.5875 0.4468 0.6684
No log 11.6667 70 0.4934 0.5580 0.4934 0.7024
No log 12.0 72 0.4488 0.5797 0.4488 0.6699
No log 12.3333 74 0.6137 0.5982 0.6137 0.7834
No log 12.6667 76 0.7146 0.4789 0.7146 0.8453
No log 13.0 78 0.5230 0.6648 0.5230 0.7232
No log 13.3333 80 0.4259 0.6173 0.4259 0.6526
No log 13.6667 82 0.5126 0.6039 0.5126 0.7160
No log 14.0 84 0.4583 0.6135 0.4583 0.6770
No log 14.3333 86 0.4032 0.6383 0.4032 0.6350
No log 14.6667 88 0.4511 0.6612 0.4511 0.6717
No log 15.0 90 0.4414 0.6414 0.4414 0.6644
No log 15.3333 92 0.3988 0.6383 0.3988 0.6315
No log 15.6667 94 0.4826 0.5998 0.4826 0.6947
No log 16.0 96 0.5164 0.6005 0.5164 0.7186
No log 16.3333 98 0.4738 0.5484 0.4738 0.6884
No log 16.6667 100 0.4019 0.6632 0.4019 0.6340
No log 17.0 102 0.4020 0.6935 0.4020 0.6340
No log 17.3333 104 0.4093 0.6655 0.4093 0.6397
No log 17.6667 106 0.4044 0.6426 0.4044 0.6359
No log 18.0 108 0.4241 0.6018 0.4241 0.6512
No log 18.3333 110 0.4218 0.6541 0.4218 0.6494
No log 18.6667 112 0.4051 0.6566 0.4051 0.6365
No log 19.0 114 0.4537 0.6154 0.4537 0.6736
No log 19.3333 116 0.5545 0.5502 0.5545 0.7446
No log 19.6667 118 0.5974 0.5228 0.5974 0.7729
No log 20.0 120 0.4793 0.5975 0.4793 0.6923
No log 20.3333 122 0.4212 0.6452 0.4212 0.6490
No log 20.6667 124 0.4705 0.5501 0.4705 0.6859
No log 21.0 126 0.4867 0.5254 0.4867 0.6976
No log 21.3333 128 0.4374 0.5800 0.4374 0.6613
No log 21.6667 130 0.4878 0.5317 0.4878 0.6984
No log 22.0 132 0.5044 0.5763 0.5044 0.7102
No log 22.3333 134 0.4368 0.6170 0.4368 0.6609
No log 22.6667 136 0.4517 0.6228 0.4517 0.6721
No log 23.0 138 0.5429 0.5870 0.5429 0.7368
No log 23.3333 140 0.5410 0.5893 0.5410 0.7355
No log 23.6667 142 0.4338 0.6142 0.4338 0.6586
No log 24.0 144 0.4906 0.6514 0.4906 0.7004
No log 24.3333 146 0.6033 0.6023 0.6033 0.7767
No log 24.6667 148 0.5479 0.5897 0.5479 0.7402
No log 25.0 150 0.4672 0.5548 0.4672 0.6835
No log 25.3333 152 0.5002 0.5612 0.5002 0.7073
No log 25.6667 154 0.5336 0.5597 0.5336 0.7305
No log 26.0 156 0.5167 0.5632 0.5167 0.7188
No log 26.3333 158 0.4822 0.4934 0.4822 0.6944
No log 26.6667 160 0.4821 0.5538 0.4821 0.6944
No log 27.0 162 0.4945 0.5731 0.4945 0.7032
No log 27.3333 164 0.4906 0.5731 0.4906 0.7005
No log 27.6667 166 0.4870 0.5765 0.4870 0.6979
No log 28.0 168 0.4674 0.5765 0.4674 0.6836
No log 28.3333 170 0.4557 0.6020 0.4557 0.6751
No log 28.6667 172 0.4503 0.5767 0.4503 0.6711
No log 29.0 174 0.4410 0.5753 0.4410 0.6641
No log 29.3333 176 0.4328 0.6233 0.4328 0.6579
No log 29.6667 178 0.4172 0.6052 0.4172 0.6459
No log 30.0 180 0.4031 0.6014 0.4031 0.6349
No log 30.3333 182 0.3975 0.6118 0.3975 0.6305
No log 30.6667 184 0.3950 0.6267 0.3950 0.6285
No log 31.0 186 0.4296 0.6797 0.4296 0.6554
No log 31.3333 188 0.4791 0.6485 0.4791 0.6922
No log 31.6667 190 0.5026 0.6566 0.5026 0.7089
No log 32.0 192 0.4625 0.6419 0.4625 0.6800
No log 32.3333 194 0.4238 0.6452 0.4238 0.6510
No log 32.6667 196 0.4083 0.5926 0.4083 0.6390
No log 33.0 198 0.4074 0.6688 0.4074 0.6383
No log 33.3333 200 0.4036 0.6395 0.4036 0.6353
No log 33.6667 202 0.4003 0.5993 0.4003 0.6327
No log 34.0 204 0.3982 0.6267 0.3982 0.6311
No log 34.3333 206 0.4035 0.6443 0.4035 0.6352
No log 34.6667 208 0.4111 0.6797 0.4111 0.6412
No log 35.0 210 0.4121 0.6608 0.4121 0.6419
No log 35.3333 212 0.4036 0.6627 0.4036 0.6353
No log 35.6667 214 0.4181 0.6709 0.4181 0.6466
No log 36.0 216 0.4132 0.6623 0.4132 0.6428
No log 36.3333 218 0.3974 0.6118 0.3974 0.6304
No log 36.6667 220 0.4415 0.6972 0.4415 0.6644
No log 37.0 222 0.4400 0.6972 0.4400 0.6634
No log 37.3333 224 0.4150 0.6698 0.4150 0.6442
No log 37.6667 226 0.4056 0.6837 0.4056 0.6369
No log 38.0 228 0.4174 0.6807 0.4174 0.6461
No log 38.3333 230 0.4157 0.6716 0.4157 0.6448
No log 38.6667 232 0.4284 0.6418 0.4284 0.6545
No log 39.0 234 0.4213 0.6024 0.4213 0.6490
No log 39.3333 236 0.4231 0.6125 0.4231 0.6505
No log 39.6667 238 0.4228 0.6111 0.4228 0.6503
No log 40.0 240 0.4240 0.6242 0.4240 0.6511
No log 40.3333 242 0.4221 0.6242 0.4221 0.6497
No log 40.6667 244 0.4208 0.6292 0.4208 0.6487
No log 41.0 246 0.4264 0.6020 0.4264 0.6530
No log 41.3333 248 0.4214 0.6020 0.4214 0.6491
No log 41.6667 250 0.4263 0.6223 0.4263 0.6529
No log 42.0 252 0.4202 0.6223 0.4202 0.6482
No log 42.3333 254 0.4081 0.6566 0.4081 0.6388
No log 42.6667 256 0.4152 0.6818 0.4152 0.6444
No log 43.0 258 0.4146 0.6818 0.4146 0.6439
No log 43.3333 260 0.4108 0.6639 0.4108 0.6409
No log 43.6667 262 0.4084 0.6587 0.4084 0.6391
No log 44.0 264 0.4353 0.6419 0.4353 0.6598
No log 44.3333 266 0.4473 0.6337 0.4473 0.6688
No log 44.6667 268 0.4222 0.6598 0.4222 0.6498
No log 45.0 270 0.4081 0.6672 0.4081 0.6388
No log 45.3333 272 0.4151 0.6739 0.4151 0.6443
No log 45.6667 274 0.4315 0.6818 0.4315 0.6569
No log 46.0 276 0.4593 0.5653 0.4593 0.6777
No log 46.3333 278 0.4701 0.5712 0.4701 0.6856
No log 46.6667 280 0.4502 0.6018 0.4502 0.6710
No log 47.0 282 0.4301 0.5941 0.4301 0.6558
No log 47.3333 284 0.4256 0.6183 0.4256 0.6524
No log 47.6667 286 0.4807 0.6141 0.4807 0.6934
No log 48.0 288 0.5258 0.5051 0.5258 0.7251
No log 48.3333 290 0.5315 0.4814 0.5315 0.7290
No log 48.6667 292 0.4796 0.6013 0.4796 0.6926
No log 49.0 294 0.4485 0.5522 0.4485 0.6697
No log 49.3333 296 0.4602 0.5326 0.4602 0.6784
No log 49.6667 298 0.4973 0.5485 0.4973 0.7052
No log 50.0 300 0.5413 0.5131 0.5413 0.7357
No log 50.3333 302 0.6260 0.5241 0.6260 0.7912
No log 50.6667 304 0.6877 0.4764 0.6877 0.8293
No log 51.0 306 0.6361 0.4404 0.6361 0.7976
No log 51.3333 308 0.5585 0.4451 0.5585 0.7474
No log 51.6667 310 0.5036 0.5326 0.5036 0.7097
No log 52.0 312 0.4729 0.5665 0.4729 0.6877
No log 52.3333 314 0.4576 0.5600 0.4576 0.6765
No log 52.6667 316 0.4545 0.5800 0.4545 0.6742
No log 53.0 318 0.4742 0.5597 0.4742 0.6886
No log 53.3333 320 0.4875 0.5552 0.4875 0.6982
No log 53.6667 322 0.4809 0.5831 0.4809 0.6935
No log 54.0 324 0.4882 0.5845 0.4882 0.6987
No log 54.3333 326 0.4895 0.5673 0.4895 0.6997
No log 54.6667 328 0.4697 0.5758 0.4697 0.6853
No log 55.0 330 0.4388 0.7012 0.4388 0.6624
No log 55.3333 332 0.4395 0.6993 0.4395 0.6630
No log 55.6667 334 0.4308 0.6730 0.4308 0.6564
No log 56.0 336 0.4379 0.6993 0.4379 0.6617
No log 56.3333 338 0.4425 0.6894 0.4425 0.6652
No log 56.6667 340 0.4370 0.6894 0.4370 0.6611
No log 57.0 342 0.4112 0.6730 0.4112 0.6412
No log 57.3333 344 0.3973 0.6469 0.3973 0.6303
No log 57.6667 346 0.4023 0.6491 0.4023 0.6343
No log 58.0 348 0.4164 0.6598 0.4164 0.6453
No log 58.3333 350 0.4092 0.6683 0.4092 0.6397
No log 58.6667 352 0.3975 0.6492 0.3975 0.6305
No log 59.0 354 0.3949 0.6481 0.3949 0.6284
No log 59.3333 356 0.3979 0.6730 0.3979 0.6308
No log 59.6667 358 0.4109 0.6818 0.4109 0.6410
No log 60.0 360 0.4091 0.6364 0.4091 0.6396
No log 60.3333 362 0.4079 0.6279 0.4079 0.6386
No log 60.6667 364 0.4176 0.6279 0.4176 0.6462
No log 61.0 366 0.4228 0.6279 0.4228 0.6503
No log 61.3333 368 0.4258 0.6279 0.4258 0.6525
No log 61.6667 370 0.4301 0.6279 0.4301 0.6558
No log 62.0 372 0.4383 0.6171 0.4383 0.6620
No log 62.3333 374 0.4483 0.5936 0.4483 0.6696
No log 62.6667 376 0.4502 0.6185 0.4502 0.6710
No log 63.0 378 0.4593 0.5974 0.4593 0.6778
No log 63.3333 380 0.4792 0.6025 0.4792 0.6922
No log 63.6667 382 0.4881 0.5395 0.4881 0.6986
No log 64.0 384 0.4747 0.5577 0.4747 0.6890
No log 64.3333 386 0.4615 0.5362 0.4615 0.6793
No log 64.6667 388 0.4589 0.5550 0.4589 0.6774
No log 65.0 390 0.4631 0.5307 0.4631 0.6805
No log 65.3333 392 0.4756 0.5345 0.4756 0.6897
No log 65.6667 394 0.4797 0.5324 0.4797 0.6926
No log 66.0 396 0.4817 0.5324 0.4817 0.6941
No log 66.3333 398 0.4797 0.5543 0.4797 0.6926
No log 66.6667 400 0.4687 0.5784 0.4687 0.6846
No log 67.0 402 0.4573 0.5899 0.4573 0.6762
No log 67.3333 404 0.4524 0.5899 0.4524 0.6726
No log 67.6667 406 0.4535 0.5899 0.4535 0.6734
No log 68.0 408 0.4518 0.5899 0.4518 0.6721
No log 68.3333 410 0.4514 0.6125 0.4514 0.6718
No log 68.6667 412 0.4563 0.5899 0.4563 0.6755
No log 69.0 414 0.4647 0.5899 0.4647 0.6817
No log 69.3333 416 0.4759 0.5326 0.4759 0.6898
No log 69.6667 418 0.4937 0.5056 0.4937 0.7027
No log 70.0 420 0.5048 0.5289 0.5048 0.7105
No log 70.3333 422 0.5203 0.4875 0.5203 0.7213
No log 70.6667 424 0.5202 0.4875 0.5202 0.7212
No log 71.0 426 0.4976 0.5056 0.4976 0.7054
No log 71.3333 428 0.4870 0.5076 0.4870 0.6979
No log 71.6667 430 0.4695 0.5899 0.4695 0.6852
No log 72.0 432 0.4594 0.5815 0.4594 0.6778
No log 72.3333 434 0.4633 0.6101 0.4633 0.6807
No log 72.6667 436 0.4765 0.6013 0.4765 0.6903
No log 73.0 438 0.4881 0.5528 0.4881 0.6987
No log 73.3333 440 0.5001 0.5528 0.5001 0.7072
No log 73.6667 442 0.4843 0.5438 0.4843 0.6959
No log 74.0 444 0.4621 0.6114 0.4621 0.6798
No log 74.3333 446 0.4542 0.6020 0.4542 0.6740
No log 74.6667 448 0.4473 0.6292 0.4473 0.6688
No log 75.0 450 0.4481 0.5956 0.4481 0.6694
No log 75.3333 452 0.4559 0.5899 0.4559 0.6752
No log 75.6667 454 0.4615 0.6214 0.4615 0.6793
No log 76.0 456 0.4659 0.6214 0.4659 0.6826
No log 76.3333 458 0.4675 0.5995 0.4675 0.6838
No log 76.6667 460 0.4730 0.6201 0.4730 0.6877
No log 77.0 462 0.4806 0.6201 0.4806 0.6932
No log 77.3333 464 0.5030 0.5736 0.5030 0.7092
No log 77.6667 466 0.5228 0.5327 0.5228 0.7230
No log 78.0 468 0.5306 0.5544 0.5306 0.7284
No log 78.3333 470 0.5266 0.5327 0.5266 0.7257
No log 78.6667 472 0.5202 0.5327 0.5202 0.7212
No log 79.0 474 0.5239 0.5327 0.5239 0.7238
No log 79.3333 476 0.5140 0.5736 0.5140 0.7169
No log 79.6667 478 0.5015 0.5736 0.5015 0.7081
No log 80.0 480 0.4955 0.6401 0.4955 0.7039
No log 80.3333 482 0.4884 0.5784 0.4884 0.6988
No log 80.6667 484 0.4905 0.5784 0.4905 0.7003
No log 81.0 486 0.4929 0.5550 0.4929 0.7021
No log 81.3333 488 0.4925 0.5550 0.4925 0.7018
No log 81.6667 490 0.4881 0.5567 0.4881 0.6987
No log 82.0 492 0.4824 0.5567 0.4824 0.6945
No log 82.3333 494 0.4747 0.5899 0.4747 0.6890
No log 82.6667 496 0.4694 0.5899 0.4694 0.6851
No log 83.0 498 0.4675 0.6125 0.4675 0.6837
0.2204 83.3333 500 0.4685 0.6125 0.4685 0.6845
0.2204 83.6667 502 0.4704 0.5899 0.4704 0.6859
0.2204 84.0 504 0.4733 0.5665 0.4733 0.6880
0.2204 84.3333 506 0.4770 0.5665 0.4770 0.6906
0.2204 84.6667 508 0.4794 0.5567 0.4794 0.6924
0.2204 85.0 510 0.4843 0.5567 0.4843 0.6959

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.4.0+cu118
  • Datasets 2.21.0
  • Tokenizers 0.19.1
Downloads last month
-
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for MayBashendy/ArabicNewSplits7_B_usingALLEssays_FineTuningAraBERT_run3_AugV5_k1_task7_organization

Finetuned
(4019)
this model