ArabicNewSplits7_FineTuningAraBERT_run3_AugV5_k5_task7_organization

This model is a fine-tuned version of aubmindlab/bert-base-arabertv02 on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 0.4011
  • Qwk: 0.6979
  • Mse: 0.4011
  • Rmse: 0.6334

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 100

Training results

Training Loss Epoch Step Validation Loss Qwk Mse Rmse
No log 0.1333 2 2.5900 -0.0593 2.5900 1.6094
No log 0.2667 4 1.1749 0.0993 1.1749 1.0839
No log 0.4 6 0.7766 0.0937 0.7766 0.8812
No log 0.5333 8 0.7910 0.0608 0.7910 0.8894
No log 0.6667 10 0.9163 0.2939 0.9163 0.9572
No log 0.8 12 0.7580 0.2467 0.7580 0.8706
No log 0.9333 14 0.7363 0.2063 0.7363 0.8581
No log 1.0667 16 0.9297 0.1288 0.9297 0.9642
No log 1.2 18 0.7754 0.2103 0.7754 0.8806
No log 1.3333 20 0.6582 0.1903 0.6582 0.8113
No log 1.4667 22 0.6466 0.3123 0.6466 0.8041
No log 1.6 24 0.6362 0.3494 0.6362 0.7976
No log 1.7333 26 0.6149 0.3274 0.6149 0.7841
No log 1.8667 28 0.6052 0.3354 0.6052 0.7780
No log 2.0 30 0.5955 0.2412 0.5955 0.7717
No log 2.1333 32 0.5959 0.2345 0.5959 0.7719
No log 2.2667 34 0.5791 0.2851 0.5791 0.7610
No log 2.4 36 0.5829 0.3640 0.5829 0.7635
No log 2.5333 38 0.5087 0.4561 0.5087 0.7132
No log 2.6667 40 0.4745 0.5227 0.4745 0.6888
No log 2.8 42 0.6166 0.4315 0.6166 0.7852
No log 2.9333 44 0.7377 0.4667 0.7377 0.8589
No log 3.0667 46 0.5465 0.4681 0.5465 0.7393
No log 3.2 48 0.5284 0.6206 0.5284 0.7269
No log 3.3333 50 0.7271 0.4667 0.7271 0.8527
No log 3.4667 52 0.5653 0.5664 0.5653 0.7518
No log 3.6 54 0.5174 0.4966 0.5174 0.7193
No log 3.7333 56 0.6063 0.4982 0.6063 0.7787
No log 3.8667 58 0.4958 0.5288 0.4958 0.7042
No log 4.0 60 0.6391 0.5160 0.6391 0.7994
No log 4.1333 62 0.8175 0.4568 0.8175 0.9042
No log 4.2667 64 0.6750 0.4977 0.6750 0.8216
No log 4.4 66 0.4779 0.6317 0.4779 0.6913
No log 4.5333 68 0.6323 0.5215 0.6323 0.7952
No log 4.6667 70 0.8169 0.4511 0.8169 0.9038
No log 4.8 72 0.6603 0.4648 0.6603 0.8126
No log 4.9333 74 0.4626 0.6032 0.4626 0.6802
No log 5.0667 76 0.6003 0.5489 0.6003 0.7748
No log 5.2 78 0.6730 0.5093 0.6730 0.8204
No log 5.3333 80 0.5690 0.5595 0.5690 0.7544
No log 5.4667 82 0.4984 0.5559 0.4984 0.7060
No log 5.6 84 0.8386 0.4953 0.8386 0.9158
No log 5.7333 86 0.9554 0.4670 0.9554 0.9775
No log 5.8667 88 0.7667 0.4844 0.7667 0.8756
No log 6.0 90 0.5351 0.5770 0.5351 0.7315
No log 6.1333 92 0.5217 0.6677 0.5217 0.7223
No log 6.2667 94 0.5358 0.6773 0.5358 0.7320
No log 6.4 96 0.5136 0.5874 0.5136 0.7167
No log 6.5333 98 0.5468 0.5341 0.5468 0.7395
No log 6.6667 100 0.6543 0.4805 0.6543 0.8089
No log 6.8 102 0.6473 0.4805 0.6473 0.8045
No log 6.9333 104 0.6289 0.4610 0.6289 0.7931
No log 7.0667 106 0.6202 0.4385 0.6202 0.7875
No log 7.2 108 0.5221 0.5015 0.5221 0.7225
No log 7.3333 110 0.5037 0.5324 0.5037 0.7097
No log 7.4667 112 0.4985 0.5324 0.4985 0.7060
No log 7.6 114 0.4617 0.5988 0.4617 0.6795
No log 7.7333 116 0.4479 0.6254 0.4479 0.6692
No log 7.8667 118 0.4549 0.6101 0.4549 0.6744
No log 8.0 120 0.4573 0.6101 0.4573 0.6762
No log 8.1333 122 0.4264 0.6142 0.4264 0.6530
No log 8.2667 124 0.5073 0.5677 0.5073 0.7123
No log 8.4 126 0.5225 0.5497 0.5225 0.7229
No log 8.5333 128 0.4386 0.6004 0.4386 0.6623
No log 8.6667 130 0.4203 0.6655 0.4203 0.6483
No log 8.8 132 0.4230 0.6957 0.4230 0.6504
No log 8.9333 134 0.4250 0.6863 0.4250 0.6519
No log 9.0667 136 0.4173 0.6060 0.4173 0.6460
No log 9.2 138 0.4440 0.6214 0.4440 0.6663
No log 9.3333 140 0.4620 0.6214 0.4620 0.6797
No log 9.4667 142 0.4820 0.5983 0.4820 0.6942
No log 9.6 144 0.5331 0.5418 0.5331 0.7301
No log 9.7333 146 0.6064 0.5614 0.6064 0.7787
No log 9.8667 148 0.6337 0.6 0.6337 0.7961
No log 10.0 150 0.4586 0.5961 0.4586 0.6772
No log 10.1333 152 0.4219 0.6750 0.4219 0.6495
No log 10.2667 154 0.4242 0.6197 0.4242 0.6513
No log 10.4 156 0.4261 0.6197 0.4261 0.6528
No log 10.5333 158 0.4245 0.6007 0.4245 0.6516
No log 10.6667 160 0.4566 0.5682 0.4566 0.6757
No log 10.8 162 0.4796 0.5831 0.4796 0.6925
No log 10.9333 164 0.4655 0.6408 0.4655 0.6823
No log 11.0667 166 0.4804 0.5875 0.4804 0.6931
No log 11.2 168 0.4771 0.5875 0.4771 0.6907
No log 11.3333 170 0.4563 0.6530 0.4563 0.6755
No log 11.4667 172 0.4549 0.6158 0.4549 0.6745
No log 11.6 174 0.5564 0.5763 0.5564 0.7459
No log 11.7333 176 0.5986 0.5813 0.5986 0.7737
No log 11.8667 178 0.5064 0.6110 0.5064 0.7116
No log 12.0 180 0.4445 0.5840 0.4445 0.6667
No log 12.1333 182 0.4560 0.6709 0.4560 0.6753
No log 12.2667 184 0.4519 0.6526 0.4519 0.6722
No log 12.4 186 0.4397 0.6310 0.4397 0.6631
No log 12.5333 188 0.5274 0.5323 0.5274 0.7262
No log 12.6667 190 0.6100 0.5738 0.6100 0.7810
No log 12.8 192 0.5346 0.5170 0.5346 0.7312
No log 12.9333 194 0.4397 0.5988 0.4397 0.6631
No log 13.0667 196 0.4459 0.6282 0.4459 0.6678
No log 13.2 198 0.4369 0.6464 0.4369 0.6610
No log 13.3333 200 0.4576 0.5692 0.4576 0.6765
No log 13.4667 202 0.4852 0.5468 0.4852 0.6966
No log 13.6 204 0.4550 0.5897 0.4550 0.6745
No log 13.7333 206 0.4527 0.6078 0.4527 0.6728
No log 13.8667 208 0.4751 0.6232 0.4751 0.6892
No log 14.0 210 0.4853 0.5989 0.4853 0.6966
No log 14.1333 212 0.4767 0.6235 0.4767 0.6904
No log 14.2667 214 0.4885 0.6419 0.4885 0.6989
No log 14.4 216 0.4609 0.6087 0.4609 0.6789
No log 14.5333 218 0.4658 0.5609 0.4658 0.6825
No log 14.6667 220 0.4691 0.5687 0.4691 0.6849
No log 14.8 222 0.4725 0.5687 0.4725 0.6874
No log 14.9333 224 0.4984 0.5560 0.4984 0.7060
No log 15.0667 226 0.5846 0.4821 0.5846 0.7646
No log 15.2 228 0.6218 0.5351 0.6218 0.7885
No log 15.3333 230 0.5455 0.5625 0.5455 0.7386
No log 15.4667 232 0.4975 0.5184 0.4975 0.7053
No log 15.6 234 0.4956 0.5160 0.4956 0.7040
No log 15.7333 236 0.4843 0.5269 0.4843 0.6959
No log 15.8667 238 0.4912 0.5723 0.4912 0.7008
No log 16.0 240 0.5328 0.4911 0.5328 0.7300
No log 16.1333 242 0.4932 0.5086 0.4932 0.7023
No log 16.2667 244 0.4599 0.6214 0.4599 0.6782
No log 16.4 246 0.4562 0.5373 0.4562 0.6754
No log 16.5333 248 0.4526 0.5373 0.4526 0.6728
No log 16.6667 250 0.4540 0.6032 0.4540 0.6738
No log 16.8 252 0.4827 0.5801 0.4827 0.6948
No log 16.9333 254 0.4973 0.5801 0.4973 0.7052
No log 17.0667 256 0.4659 0.6032 0.4659 0.6826
No log 17.2 258 0.4523 0.6371 0.4523 0.6725
No log 17.3333 260 0.4680 0.6445 0.4680 0.6841
No log 17.4667 262 0.4688 0.6445 0.4688 0.6847
No log 17.6 264 0.4549 0.6267 0.4549 0.6744
No log 17.7333 266 0.5102 0.5511 0.5102 0.7143
No log 17.8667 268 0.6015 0.5750 0.6015 0.7755
No log 18.0 270 0.5854 0.5206 0.5854 0.7651
No log 18.1333 272 0.4987 0.5468 0.4987 0.7062
No log 18.2667 274 0.4452 0.5941 0.4452 0.6672
No log 18.4 276 0.4604 0.5970 0.4604 0.6785
No log 18.5333 278 0.4719 0.5904 0.4719 0.6869
No log 18.6667 280 0.4530 0.5970 0.4530 0.6731
No log 18.8 282 0.4443 0.6455 0.4443 0.6666
No log 18.9333 284 0.4535 0.6371 0.4535 0.6734
No log 19.0667 286 0.4625 0.6747 0.4625 0.6801
No log 19.2 288 0.4730 0.7067 0.4730 0.6878
No log 19.3333 290 0.4657 0.6580 0.4657 0.6824
No log 19.4667 292 0.4513 0.6561 0.4513 0.6718
No log 19.6 294 0.4374 0.6939 0.4374 0.6614
No log 19.7333 296 0.4400 0.6210 0.4400 0.6634
No log 19.8667 298 0.4362 0.6295 0.4362 0.6605
No log 20.0 300 0.4295 0.6854 0.4295 0.6553
No log 20.1333 302 0.4302 0.6542 0.4302 0.6559
No log 20.2667 304 0.4372 0.6816 0.4372 0.6612
No log 20.4 306 0.4453 0.6747 0.4453 0.6673
No log 20.5333 308 0.4767 0.6793 0.4767 0.6904
No log 20.6667 310 0.4810 0.6195 0.4810 0.6936
No log 20.8 312 0.4682 0.6749 0.4682 0.6843
No log 20.9333 314 0.4607 0.6590 0.4607 0.6788
No log 21.0667 316 0.4600 0.6572 0.4600 0.6783
No log 21.2 318 0.4583 0.5796 0.4583 0.6770
No log 21.3333 320 0.4620 0.5867 0.4620 0.6797
No log 21.4667 322 0.4616 0.5812 0.4616 0.6794
No log 21.6 324 0.4677 0.5812 0.4677 0.6839
No log 21.7333 326 0.4820 0.6353 0.4820 0.6943
No log 21.8667 328 0.4965 0.5396 0.4965 0.7046
No log 22.0 330 0.4853 0.6434 0.4853 0.6966
No log 22.1333 332 0.4696 0.5687 0.4696 0.6853
No log 22.2667 334 0.4855 0.5495 0.4855 0.6968
No log 22.4 336 0.4815 0.5266 0.4815 0.6939
No log 22.5333 338 0.4720 0.6010 0.4720 0.6870
No log 22.6667 340 0.4843 0.5980 0.4843 0.6959
No log 22.8 342 0.4683 0.5912 0.4683 0.6843
No log 22.9333 344 0.4646 0.5750 0.4646 0.6816
No log 23.0667 346 0.4661 0.5782 0.4661 0.6827
No log 23.2 348 0.4851 0.5933 0.4851 0.6965
No log 23.3333 350 0.5717 0.5484 0.5717 0.7561
No log 23.4667 352 0.5733 0.5582 0.5733 0.7572
No log 23.6 354 0.4975 0.6010 0.4975 0.7054
No log 23.7333 356 0.4709 0.5413 0.4709 0.6862
No log 23.8667 358 0.4711 0.6084 0.4711 0.6864
No log 24.0 360 0.4543 0.5987 0.4543 0.6740
No log 24.1333 362 0.4495 0.6298 0.4495 0.6705
No log 24.2667 364 0.4502 0.6346 0.4502 0.6710
No log 24.4 366 0.4600 0.5845 0.4600 0.6783
No log 24.5333 368 0.4447 0.6200 0.4447 0.6669
No log 24.6667 370 0.4374 0.5765 0.4374 0.6614
No log 24.8 372 0.4494 0.5495 0.4494 0.6704
No log 24.9333 374 0.4636 0.5779 0.4636 0.6809
No log 25.0667 376 0.4642 0.5779 0.4642 0.6813
No log 25.2 378 0.4371 0.6156 0.4371 0.6612
No log 25.3333 380 0.4379 0.5853 0.4379 0.6617
No log 25.4667 382 0.4705 0.5877 0.4705 0.6859
No log 25.6 384 0.4868 0.6282 0.4868 0.6977
No log 25.7333 386 0.4605 0.6210 0.4605 0.6786
No log 25.8667 388 0.4374 0.6014 0.4374 0.6613
No log 26.0 390 0.4351 0.5634 0.4351 0.6596
No log 26.1333 392 0.4430 0.5457 0.4430 0.6656
No log 26.2667 394 0.4622 0.5367 0.4622 0.6799
No log 26.4 396 0.4746 0.5693 0.4746 0.6889
No log 26.5333 398 0.4939 0.5528 0.4939 0.7028
No log 26.6667 400 0.5018 0.5758 0.5018 0.7083
No log 26.8 402 0.4799 0.5897 0.4799 0.6928
No log 26.9333 404 0.4537 0.6552 0.4537 0.6736
No log 27.0667 406 0.4472 0.6750 0.4472 0.6687
No log 27.2 408 0.4456 0.6661 0.4456 0.6675
No log 27.3333 410 0.4452 0.6667 0.4452 0.6672
No log 27.4667 412 0.4444 0.6667 0.4444 0.6666
No log 27.6 414 0.4482 0.6555 0.4482 0.6694
No log 27.7333 416 0.4491 0.6555 0.4491 0.6701
No log 27.8667 418 0.4499 0.6555 0.4499 0.6707
No log 28.0 420 0.4503 0.6555 0.4503 0.6711
No log 28.1333 422 0.4541 0.6555 0.4541 0.6739
No log 28.2667 424 0.4591 0.6344 0.4591 0.6776
No log 28.4 426 0.4764 0.5897 0.4764 0.6902
No log 28.5333 428 0.5039 0.5673 0.5039 0.7098
No log 28.6667 430 0.4979 0.5601 0.4979 0.7056
No log 28.8 432 0.4648 0.5995 0.4648 0.6817
No log 28.9333 434 0.4480 0.6091 0.4480 0.6693
No log 29.0667 436 0.4539 0.5571 0.4539 0.6737
No log 29.2 438 0.4574 0.5702 0.4574 0.6763
No log 29.3333 440 0.4584 0.6170 0.4584 0.6771
No log 29.4667 442 0.4460 0.5781 0.4460 0.6679
No log 29.6 444 0.4351 0.6197 0.4351 0.6596
No log 29.7333 446 0.4321 0.6185 0.4321 0.6573
No log 29.8667 448 0.4263 0.6171 0.4263 0.6529
No log 30.0 450 0.4333 0.6517 0.4333 0.6583
No log 30.1333 452 0.4367 0.6517 0.4367 0.6608
No log 30.2667 454 0.4330 0.6628 0.4330 0.6580
No log 30.4 456 0.4300 0.6435 0.4300 0.6558
No log 30.5333 458 0.4254 0.6435 0.4254 0.6522
No log 30.6667 460 0.4265 0.6648 0.4265 0.6531
No log 30.8 462 0.4296 0.6269 0.4296 0.6554
No log 30.9333 464 0.4245 0.6183 0.4245 0.6515
No log 31.0667 466 0.4234 0.6101 0.4234 0.6507
No log 31.2 468 0.4199 0.6183 0.4199 0.6480
No log 31.3333 470 0.4124 0.6464 0.4124 0.6422
No log 31.4667 472 0.4096 0.6464 0.4096 0.6400
No log 31.6 474 0.4202 0.6639 0.4202 0.6482
No log 31.7333 476 0.4247 0.6634 0.4247 0.6517
No log 31.8667 478 0.4188 0.6467 0.4188 0.6471
No log 32.0 480 0.4298 0.6020 0.4298 0.6556
No log 32.1333 482 0.4597 0.5642 0.4597 0.6780
No log 32.2667 484 0.4680 0.5438 0.4680 0.6841
No log 32.4 486 0.4420 0.5406 0.4420 0.6648
No log 32.5333 488 0.4228 0.5831 0.4228 0.6502
No log 32.6667 490 0.4439 0.6716 0.4439 0.6663
No log 32.8 492 0.4714 0.6767 0.4714 0.6866
No log 32.9333 494 0.5053 0.6110 0.5053 0.7109
No log 33.0667 496 0.5126 0.6060 0.5126 0.7160
No log 33.2 498 0.4702 0.5929 0.4702 0.6857
0.271 33.3333 500 0.4259 0.7198 0.4259 0.6526
0.271 33.4667 502 0.4398 0.7284 0.4398 0.6632
0.271 33.6 504 0.4743 0.6729 0.4743 0.6887
0.271 33.7333 506 0.4730 0.6729 0.4730 0.6878
0.271 33.8667 508 0.4430 0.6361 0.4430 0.6655
0.271 34.0 510 0.4130 0.6932 0.4130 0.6426
0.271 34.1333 512 0.4011 0.6979 0.4011 0.6334

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.4.0+cu118
  • Datasets 2.21.0
  • Tokenizers 0.19.1
Downloads last month
-
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for MayBashendy/ArabicNewSplits7_FineTuningAraBERT_run3_AugV5_k5_task7_organization

Finetuned
(4023)
this model