ArabicNewSplits6_FineTuningAraBERTFreeze_run1_AugV5_k7_task2_organization

This model is a fine-tuned version of aubmindlab/bert-base-arabertv02 on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 0.7821
  • Qwk: 0.4838
  • Mse: 0.7821
  • Rmse: 0.8844

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 16
  • eval_batch_size: 16
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 100

Training results

Training Loss Epoch Step Validation Loss Qwk Mse Rmse
No log 0.1111 2 6.4415 -0.0278 6.4415 2.5380
No log 0.2222 4 4.2964 -0.0299 4.2964 2.0728
No log 0.3333 6 2.8983 -0.0143 2.8983 1.7024
No log 0.4444 8 1.9429 0.0202 1.9429 1.3939
No log 0.5556 10 1.3324 0.0592 1.3324 1.1543
No log 0.6667 12 1.0152 0.0171 1.0152 1.0076
No log 0.7778 14 1.0439 0.0066 1.0439 1.0217
No log 0.8889 16 1.0205 -0.0289 1.0205 1.0102
No log 1.0 18 0.8632 0.0258 0.8632 0.9291
No log 1.1111 20 0.7558 0.1323 0.7558 0.8694
No log 1.2222 22 0.7082 0.2023 0.7082 0.8416
No log 1.3333 24 0.6901 0.2990 0.6901 0.8307
No log 1.4444 26 0.7006 0.2981 0.7006 0.8370
No log 1.5556 28 0.7464 0.1800 0.7464 0.8639
No log 1.6667 30 0.8080 0.1493 0.8080 0.8989
No log 1.7778 32 0.7994 0.2130 0.7994 0.8941
No log 1.8889 34 0.7356 0.2374 0.7356 0.8577
No log 2.0 36 0.6706 0.2917 0.6706 0.8189
No log 2.1111 38 0.6363 0.3347 0.6363 0.7977
No log 2.2222 40 0.6210 0.3347 0.6210 0.7880
No log 2.3333 42 0.6210 0.3260 0.6210 0.7880
No log 2.4444 44 0.6398 0.3162 0.6398 0.7998
No log 2.5556 46 0.6442 0.3284 0.6442 0.8026
No log 2.6667 48 0.6340 0.3385 0.6340 0.7963
No log 2.7778 50 0.6272 0.3759 0.6272 0.7920
No log 2.8889 52 0.6252 0.3759 0.6252 0.7907
No log 3.0 54 0.6119 0.3845 0.6119 0.7822
No log 3.1111 56 0.5955 0.3947 0.5955 0.7717
No log 3.2222 58 0.6029 0.3434 0.6029 0.7765
No log 3.3333 60 0.6286 0.3386 0.6286 0.7928
No log 3.4444 62 0.6481 0.3224 0.6481 0.8051
No log 3.5556 64 0.6201 0.3666 0.6201 0.7875
No log 3.6667 66 0.6035 0.3832 0.6035 0.7768
No log 3.7778 68 0.5660 0.4079 0.5660 0.7523
No log 3.8889 70 0.5524 0.3821 0.5524 0.7432
No log 4.0 72 0.5507 0.3805 0.5507 0.7421
No log 4.1111 74 0.5496 0.3860 0.5496 0.7414
No log 4.2222 76 0.5530 0.4226 0.5530 0.7437
No log 4.3333 78 0.5611 0.4440 0.5611 0.7491
No log 4.4444 80 0.5635 0.4507 0.5635 0.7507
No log 4.5556 82 0.5803 0.5027 0.5803 0.7618
No log 4.6667 84 0.6136 0.5283 0.6136 0.7833
No log 4.7778 86 0.6552 0.4715 0.6552 0.8095
No log 4.8889 88 0.7247 0.4419 0.7247 0.8513
No log 5.0 90 0.8251 0.4029 0.8251 0.9083
No log 5.1111 92 0.7314 0.4380 0.7314 0.8552
No log 5.2222 94 0.5919 0.5202 0.5919 0.7694
No log 5.3333 96 0.5491 0.4974 0.5491 0.7410
No log 5.4444 98 0.5443 0.4793 0.5443 0.7378
No log 5.5556 100 0.5444 0.5147 0.5444 0.7378
No log 5.6667 102 0.5493 0.4985 0.5493 0.7412
No log 5.7778 104 0.5641 0.4706 0.5641 0.7511
No log 5.8889 106 0.5562 0.4944 0.5562 0.7458
No log 6.0 108 0.5525 0.5330 0.5525 0.7433
No log 6.1111 110 0.5543 0.5330 0.5543 0.7445
No log 6.2222 112 0.5570 0.5277 0.5570 0.7463
No log 6.3333 114 0.5503 0.5184 0.5503 0.7418
No log 6.4444 116 0.5860 0.5234 0.5860 0.7655
No log 6.5556 118 0.6057 0.4968 0.6057 0.7783
No log 6.6667 120 0.5787 0.4865 0.5787 0.7607
No log 6.7778 122 0.6067 0.5146 0.6067 0.7789
No log 6.8889 124 0.6410 0.4843 0.6410 0.8006
No log 7.0 126 0.6426 0.4941 0.6426 0.8016
No log 7.1111 128 0.6239 0.5532 0.6239 0.7899
No log 7.2222 130 0.6270 0.5477 0.6270 0.7918
No log 7.3333 132 0.6893 0.5387 0.6893 0.8303
No log 7.4444 134 0.8254 0.4980 0.8254 0.9085
No log 7.5556 136 0.8907 0.4728 0.8907 0.9438
No log 7.6667 138 0.8394 0.4863 0.8394 0.9162
No log 7.7778 140 0.7406 0.5180 0.7406 0.8606
No log 7.8889 142 0.6811 0.5265 0.6811 0.8253
No log 8.0 144 0.6841 0.5046 0.6841 0.8271
No log 8.1111 146 0.7304 0.4947 0.7304 0.8546
No log 8.2222 148 0.8207 0.4844 0.8207 0.9059
No log 8.3333 150 0.7565 0.4904 0.7565 0.8698
No log 8.4444 152 0.6571 0.5584 0.6571 0.8106
No log 8.5556 154 0.6191 0.4956 0.6191 0.7868
No log 8.6667 156 0.6316 0.5108 0.6316 0.7947
No log 8.7778 158 0.6217 0.5142 0.6217 0.7885
No log 8.8889 160 0.6054 0.5583 0.6054 0.7781
No log 9.0 162 0.6165 0.5752 0.6165 0.7852
No log 9.1111 164 0.6409 0.5694 0.6409 0.8006
No log 9.2222 166 0.6418 0.5796 0.6418 0.8011
No log 9.3333 168 0.6480 0.5467 0.6480 0.8050
No log 9.4444 170 0.6491 0.5448 0.6491 0.8057
No log 9.5556 172 0.6595 0.5331 0.6595 0.8121
No log 9.6667 174 0.6739 0.5203 0.6739 0.8209
No log 9.7778 176 0.6818 0.5037 0.6818 0.8257
No log 9.8889 178 0.6995 0.5208 0.6995 0.8364
No log 10.0 180 0.7113 0.5286 0.7113 0.8434
No log 10.1111 182 0.7190 0.5016 0.7190 0.8479
No log 10.2222 184 0.7063 0.5174 0.7063 0.8404
No log 10.3333 186 0.6896 0.5187 0.6896 0.8304
No log 10.4444 188 0.6896 0.4674 0.6896 0.8304
No log 10.5556 190 0.6978 0.4755 0.6978 0.8353
No log 10.6667 192 0.7077 0.4862 0.7077 0.8413
No log 10.7778 194 0.7677 0.4950 0.7677 0.8762
No log 10.8889 196 0.9264 0.4650 0.9264 0.9625
No log 11.0 198 0.9945 0.4250 0.9945 0.9973
No log 11.1111 200 0.8874 0.4454 0.8874 0.9420
No log 11.2222 202 0.7457 0.5430 0.7457 0.8635
No log 11.3333 204 0.7144 0.4890 0.7144 0.8452
No log 11.4444 206 0.7238 0.4831 0.7238 0.8508
No log 11.5556 208 0.7312 0.4873 0.7312 0.8551
No log 11.6667 210 0.7425 0.4898 0.7425 0.8617
No log 11.7778 212 0.7613 0.5386 0.7613 0.8725
No log 11.8889 214 0.7994 0.5070 0.7994 0.8941
No log 12.0 216 0.7951 0.4995 0.7951 0.8917
No log 12.1111 218 0.7754 0.4977 0.7754 0.8805
No log 12.2222 220 0.7646 0.5315 0.7646 0.8744
No log 12.3333 222 0.7668 0.5309 0.7668 0.8757
No log 12.4444 224 0.7681 0.5126 0.7681 0.8764
No log 12.5556 226 0.7853 0.5503 0.7853 0.8862
No log 12.6667 228 0.7940 0.5223 0.7940 0.8911
No log 12.7778 230 0.7771 0.5304 0.7771 0.8815
No log 12.8889 232 0.7739 0.4853 0.7739 0.8797
No log 13.0 234 0.7824 0.5148 0.7824 0.8845
No log 13.1111 236 0.7818 0.5037 0.7818 0.8842
No log 13.2222 238 0.7664 0.5069 0.7664 0.8754
No log 13.3333 240 0.7671 0.5021 0.7671 0.8758
No log 13.4444 242 0.7694 0.5013 0.7694 0.8772
No log 13.5556 244 0.7646 0.4876 0.7646 0.8744
No log 13.6667 246 0.7644 0.4948 0.7644 0.8743
No log 13.7778 248 0.7862 0.4786 0.7862 0.8867
No log 13.8889 250 0.8297 0.4789 0.8297 0.9109
No log 14.0 252 0.8491 0.4732 0.8491 0.9215
No log 14.1111 254 0.8019 0.5053 0.8019 0.8955
No log 14.2222 256 0.7636 0.4919 0.7636 0.8739
No log 14.3333 258 0.7777 0.4545 0.7777 0.8819
No log 14.4444 260 0.8102 0.4762 0.8102 0.9001
No log 14.5556 262 0.8014 0.4690 0.8014 0.8952
No log 14.6667 264 0.8050 0.4933 0.8050 0.8972
No log 14.7778 266 0.8427 0.4617 0.8427 0.9180
No log 14.8889 268 0.8467 0.4726 0.8467 0.9201
No log 15.0 270 0.8252 0.4921 0.8252 0.9084
No log 15.1111 272 0.8271 0.5008 0.8271 0.9095
No log 15.2222 274 0.8388 0.4992 0.8388 0.9159
No log 15.3333 276 0.8369 0.5138 0.8369 0.9148
No log 15.4444 278 0.8384 0.4866 0.8384 0.9157
No log 15.5556 280 0.8325 0.4668 0.8325 0.9124
No log 15.6667 282 0.8240 0.4975 0.8240 0.9078
No log 15.7778 284 0.8133 0.4680 0.8133 0.9019
No log 15.8889 286 0.8021 0.4746 0.8021 0.8956
No log 16.0 288 0.7978 0.4805 0.7978 0.8932
No log 16.1111 290 0.7943 0.4781 0.7943 0.8913
No log 16.2222 292 0.7982 0.4993 0.7982 0.8934
No log 16.3333 294 0.7901 0.4897 0.7901 0.8889
No log 16.4444 296 0.7774 0.4942 0.7774 0.8817
No log 16.5556 298 0.7770 0.4979 0.7770 0.8815
No log 16.6667 300 0.7920 0.5120 0.7920 0.8899
No log 16.7778 302 0.8144 0.4978 0.8144 0.9024
No log 16.8889 304 0.8422 0.4966 0.8422 0.9177
No log 17.0 306 0.8351 0.4713 0.8351 0.9138
No log 17.1111 308 0.8181 0.5108 0.8181 0.9045
No log 17.2222 310 0.8091 0.4885 0.8091 0.8995
No log 17.3333 312 0.8174 0.4428 0.8174 0.9041
No log 17.4444 314 0.7985 0.4536 0.7985 0.8936
No log 17.5556 316 0.7921 0.4580 0.7921 0.8900
No log 17.6667 318 0.8041 0.5030 0.8041 0.8967
No log 17.7778 320 0.7985 0.4734 0.7985 0.8936
No log 17.8889 322 0.8143 0.4821 0.8143 0.9024
No log 18.0 324 0.8181 0.4815 0.8181 0.9045
No log 18.1111 326 0.8291 0.4847 0.8291 0.9105
No log 18.2222 328 0.8566 0.5151 0.8566 0.9255
No log 18.3333 330 0.8624 0.5132 0.8624 0.9287
No log 18.4444 332 0.8747 0.4616 0.8747 0.9352
No log 18.5556 334 0.9253 0.4859 0.9253 0.9619
No log 18.6667 336 0.9139 0.4970 0.9139 0.9560
No log 18.7778 338 0.8646 0.4815 0.8646 0.9298
No log 18.8889 340 0.8288 0.5002 0.8288 0.9104
No log 19.0 342 0.8209 0.5220 0.8209 0.9061
No log 19.1111 344 0.8083 0.4977 0.8083 0.8991
No log 19.2222 346 0.8164 0.5136 0.8164 0.9036
No log 19.3333 348 0.8171 0.5334 0.8171 0.9040
No log 19.4444 350 0.8161 0.4697 0.8161 0.9034
No log 19.5556 352 0.8217 0.5007 0.8217 0.9065
No log 19.6667 354 0.8212 0.5049 0.8212 0.9062
No log 19.7778 356 0.8179 0.4708 0.8179 0.9044
No log 19.8889 358 0.8456 0.5072 0.8456 0.9196
No log 20.0 360 0.8637 0.5139 0.8637 0.9294
No log 20.1111 362 0.8488 0.5085 0.8488 0.9213
No log 20.2222 364 0.8311 0.4708 0.8311 0.9116
No log 20.3333 366 0.8484 0.4715 0.8484 0.9211
No log 20.4444 368 0.8548 0.4715 0.8548 0.9245
No log 20.5556 370 0.8442 0.4639 0.8442 0.9188
No log 20.6667 372 0.8376 0.4735 0.8376 0.9152
No log 20.7778 374 0.8397 0.5351 0.8397 0.9164
No log 20.8889 376 0.8381 0.5303 0.8381 0.9155
No log 21.0 378 0.8257 0.5015 0.8257 0.9087
No log 21.1111 380 0.8153 0.4819 0.8153 0.9029
No log 21.2222 382 0.8210 0.4594 0.8210 0.9061
No log 21.3333 384 0.8223 0.4770 0.8223 0.9068
No log 21.4444 386 0.8215 0.4786 0.8215 0.9064
No log 21.5556 388 0.8123 0.4895 0.8123 0.9013
No log 21.6667 390 0.7911 0.4504 0.7911 0.8894
No log 21.7778 392 0.7862 0.5268 0.7862 0.8867
No log 21.8889 394 0.8062 0.5344 0.8062 0.8979
No log 22.0 396 0.8153 0.5186 0.8153 0.9029
No log 22.1111 398 0.7763 0.4578 0.7763 0.8811
No log 22.2222 400 0.7484 0.4867 0.7484 0.8651
No log 22.3333 402 0.7763 0.4840 0.7763 0.8811
No log 22.4444 404 0.8123 0.4647 0.8123 0.9013
No log 22.5556 406 0.8222 0.4480 0.8222 0.9067
No log 22.6667 408 0.8153 0.5010 0.8153 0.9030
No log 22.7778 410 0.8209 0.5043 0.8209 0.9060
No log 22.8889 412 0.8126 0.4973 0.8126 0.9014
No log 23.0 414 0.8170 0.4661 0.8170 0.9039
No log 23.1111 416 0.8291 0.4603 0.8291 0.9105
No log 23.2222 418 0.8209 0.4591 0.8209 0.9060
No log 23.3333 420 0.8086 0.4611 0.8086 0.8992
No log 23.4444 422 0.8173 0.4608 0.8173 0.9041
No log 23.5556 424 0.8266 0.4863 0.8266 0.9092
No log 23.6667 426 0.8356 0.5007 0.8356 0.9141
No log 23.7778 428 0.8467 0.5090 0.8467 0.9202
No log 23.8889 430 0.8356 0.4940 0.8356 0.9141
No log 24.0 432 0.8315 0.4968 0.8315 0.9118
No log 24.1111 434 0.8317 0.4684 0.8317 0.9120
No log 24.2222 436 0.8345 0.4713 0.8345 0.9135
No log 24.3333 438 0.8387 0.4805 0.8387 0.9158
No log 24.4444 440 0.8100 0.4796 0.8100 0.9000
No log 24.5556 442 0.7950 0.4736 0.7950 0.8916
No log 24.6667 444 0.8028 0.5067 0.8028 0.8960
No log 24.7778 446 0.8011 0.4719 0.8011 0.8950
No log 24.8889 448 0.8000 0.4599 0.8000 0.8944
No log 25.0 450 0.7934 0.4722 0.7934 0.8907
No log 25.1111 452 0.7869 0.4627 0.7869 0.8871
No log 25.2222 454 0.7878 0.4640 0.7878 0.8876
No log 25.3333 456 0.7838 0.4627 0.7838 0.8853
No log 25.4444 458 0.7919 0.4640 0.7919 0.8899
No log 25.5556 460 0.7902 0.4602 0.7902 0.8889
No log 25.6667 462 0.7896 0.4524 0.7896 0.8886
No log 25.7778 464 0.7840 0.4631 0.7840 0.8855
No log 25.8889 466 0.7858 0.4562 0.7858 0.8865
No log 26.0 468 0.7874 0.4547 0.7874 0.8873
No log 26.1111 470 0.7807 0.4595 0.7807 0.8836
No log 26.2222 472 0.7751 0.4384 0.7751 0.8804
No log 26.3333 474 0.7797 0.4707 0.7797 0.8830
No log 26.4444 476 0.7936 0.4821 0.7936 0.8909
No log 26.5556 478 0.8153 0.4802 0.8153 0.9030
No log 26.6667 480 0.8341 0.4972 0.8341 0.9133
No log 26.7778 482 0.8579 0.5079 0.8579 0.9262
No log 26.8889 484 0.8782 0.4998 0.8782 0.9371
No log 27.0 486 0.8793 0.5058 0.8793 0.9377
No log 27.1111 488 0.8998 0.4534 0.8998 0.9486
No log 27.2222 490 0.9066 0.4578 0.9066 0.9522
No log 27.3333 492 0.8827 0.4578 0.8827 0.9395
No log 27.4444 494 0.8246 0.4694 0.8246 0.9081
No log 27.5556 496 0.8103 0.5065 0.8103 0.9002
No log 27.6667 498 0.8128 0.5183 0.8128 0.9015
0.5031 27.7778 500 0.7896 0.5386 0.7896 0.8886
0.5031 27.8889 502 0.7688 0.5158 0.7688 0.8768
0.5031 28.0 504 0.7534 0.4952 0.7534 0.8680
0.5031 28.1111 506 0.7565 0.5196 0.7565 0.8698
0.5031 28.2222 508 0.7665 0.4944 0.7665 0.8755
0.5031 28.3333 510 0.7821 0.4838 0.7821 0.8844

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.4.1+cu121
  • Datasets 3.2.0
  • Tokenizers 0.19.1
Downloads last month
1
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for MayBashendy/ArabicNewSplits6_FineTuningAraBERTFreeze_run1_AugV5_k7_task2_organization

Finetuned
(4023)
this model