ArabicNewSplits6_FineTuningAraBERTFreeze_run3_AugV5_k3_task2_organization

This model is a fine-tuned version of aubmindlab/bert-base-arabertv02 on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 0.9272
  • Qwk: 0.4300
  • Mse: 0.9272
  • Rmse: 0.9629

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 16
  • eval_batch_size: 16
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 100

Training results

Training Loss Epoch Step Validation Loss Qwk Mse Rmse
No log 0.2222 2 6.3041 -0.0266 6.3041 2.5108
No log 0.4444 4 4.1793 -0.0185 4.1793 2.0443
No log 0.6667 6 2.8721 0.0180 2.8721 1.6947
No log 0.8889 8 2.0804 0.0849 2.0804 1.4424
No log 1.1111 10 1.4737 0.1060 1.4737 1.2140
No log 1.3333 12 1.0277 0.0994 1.0277 1.0137
No log 1.5556 14 0.9234 0.0449 0.9234 0.9609
No log 1.7778 16 0.9072 0.0054 0.9072 0.9524
No log 2.0 18 0.9150 -0.0032 0.9150 0.9565
No log 2.2222 20 0.8488 -0.0065 0.8488 0.9213
No log 2.4444 22 0.7769 0.1856 0.7769 0.8814
No log 2.6667 24 0.7118 0.3300 0.7118 0.8437
No log 2.8889 26 0.6679 0.2693 0.6679 0.8173
No log 3.1111 28 0.6494 0.2650 0.6494 0.8058
No log 3.3333 30 0.6314 0.3010 0.6314 0.7946
No log 3.5556 32 0.6139 0.2918 0.6139 0.7835
No log 3.7778 34 0.5962 0.2950 0.5962 0.7721
No log 4.0 36 0.5866 0.3413 0.5866 0.7659
No log 4.2222 38 0.5909 0.3824 0.5909 0.7687
No log 4.4444 40 0.5846 0.3735 0.5846 0.7646
No log 4.6667 42 0.5781 0.3735 0.5781 0.7604
No log 4.8889 44 0.5774 0.3778 0.5774 0.7599
No log 5.1111 46 0.5678 0.3646 0.5678 0.7535
No log 5.3333 48 0.5659 0.4255 0.5659 0.7523
No log 5.5556 50 0.5641 0.4255 0.5641 0.7510
No log 5.7778 52 0.5789 0.4688 0.5789 0.7609
No log 6.0 54 0.5802 0.4794 0.5802 0.7617
No log 6.2222 56 0.5721 0.4944 0.5721 0.7564
No log 6.4444 58 0.5659 0.5091 0.5659 0.7523
No log 6.6667 60 0.5632 0.5012 0.5632 0.7504
No log 6.8889 62 0.5597 0.5008 0.5597 0.7481
No log 7.1111 64 0.5483 0.4736 0.5483 0.7405
No log 7.3333 66 0.5572 0.4165 0.5572 0.7465
No log 7.5556 68 0.5740 0.3809 0.5740 0.7576
No log 7.7778 70 0.5570 0.4683 0.5570 0.7463
No log 8.0 72 0.5300 0.4673 0.5300 0.7280
No log 8.2222 74 0.5572 0.5011 0.5572 0.7464
No log 8.4444 76 0.5999 0.4703 0.5999 0.7745
No log 8.6667 78 0.6162 0.4822 0.6162 0.7850
No log 8.8889 80 0.5700 0.4728 0.5700 0.7550
No log 9.1111 82 0.5357 0.5159 0.5357 0.7319
No log 9.3333 84 0.5148 0.5417 0.5148 0.7175
No log 9.5556 86 0.5130 0.4847 0.5130 0.7162
No log 9.7778 88 0.5086 0.4837 0.5086 0.7132
No log 10.0 90 0.5171 0.5280 0.5171 0.7191
No log 10.2222 92 0.5546 0.5489 0.5546 0.7447
No log 10.4444 94 0.5747 0.5473 0.5747 0.7581
No log 10.6667 96 0.6089 0.5298 0.6089 0.7803
No log 10.8889 98 0.6638 0.4813 0.6638 0.8148
No log 11.1111 100 0.6800 0.4657 0.6800 0.8246
No log 11.3333 102 0.5976 0.4776 0.5976 0.7731
No log 11.5556 104 0.5288 0.6236 0.5288 0.7272
No log 11.7778 106 0.5260 0.5988 0.5260 0.7253
No log 12.0 108 0.5422 0.5937 0.5422 0.7363
No log 12.2222 110 0.5608 0.6123 0.5608 0.7489
No log 12.4444 112 0.5962 0.5615 0.5962 0.7722
No log 12.6667 114 0.6139 0.5303 0.6139 0.7835
No log 12.8889 116 0.6099 0.5615 0.6099 0.7809
No log 13.1111 118 0.5882 0.5960 0.5882 0.7669
No log 13.3333 120 0.5918 0.5834 0.5918 0.7693
No log 13.5556 122 0.5810 0.6265 0.5810 0.7622
No log 13.7778 124 0.5929 0.5972 0.5929 0.7700
No log 14.0 126 0.5953 0.6028 0.5953 0.7715
No log 14.2222 128 0.6094 0.6253 0.6094 0.7806
No log 14.4444 130 0.6267 0.6099 0.6267 0.7917
No log 14.6667 132 0.6461 0.5370 0.6461 0.8038
No log 14.8889 134 0.6909 0.5445 0.6909 0.8312
No log 15.1111 136 0.7326 0.5089 0.7326 0.8559
No log 15.3333 138 0.6810 0.5347 0.6810 0.8252
No log 15.5556 140 0.6348 0.5824 0.6348 0.7967
No log 15.7778 142 0.6228 0.5939 0.6228 0.7892
No log 16.0 144 0.6350 0.5778 0.6350 0.7969
No log 16.2222 146 0.6567 0.5789 0.6567 0.8104
No log 16.4444 148 0.6441 0.6074 0.6441 0.8026
No log 16.6667 150 0.6562 0.5793 0.6562 0.8101
No log 16.8889 152 0.6877 0.5930 0.6877 0.8293
No log 17.1111 154 0.7018 0.5998 0.7018 0.8377
No log 17.3333 156 0.7221 0.5506 0.7221 0.8498
No log 17.5556 158 0.7441 0.5668 0.7441 0.8626
No log 17.7778 160 0.7431 0.5786 0.7431 0.8620
No log 18.0 162 0.7284 0.5689 0.7284 0.8535
No log 18.2222 164 0.6962 0.5495 0.6962 0.8344
No log 18.4444 166 0.7036 0.5820 0.7036 0.8388
No log 18.6667 168 0.7046 0.5771 0.7046 0.8394
No log 18.8889 170 0.7141 0.5830 0.7141 0.8451
No log 19.1111 172 0.7052 0.5725 0.7052 0.8398
No log 19.3333 174 0.6963 0.5114 0.6963 0.8344
No log 19.5556 176 0.7082 0.5444 0.7082 0.8416
No log 19.7778 178 0.7183 0.5343 0.7183 0.8475
No log 20.0 180 0.7219 0.4944 0.7219 0.8497
No log 20.2222 182 0.7275 0.5563 0.7275 0.8530
No log 20.4444 184 0.7395 0.5577 0.7395 0.8600
No log 20.6667 186 0.7508 0.5273 0.7508 0.8665
No log 20.8889 188 0.7786 0.5154 0.7786 0.8824
No log 21.1111 190 0.7868 0.4634 0.7868 0.8870
No log 21.3333 192 0.7473 0.5379 0.7473 0.8645
No log 21.5556 194 0.7425 0.5519 0.7425 0.8617
No log 21.7778 196 0.7416 0.5670 0.7416 0.8612
No log 22.0 198 0.7179 0.5837 0.7179 0.8473
No log 22.2222 200 0.6959 0.5727 0.6959 0.8342
No log 22.4444 202 0.7130 0.5342 0.7130 0.8444
No log 22.6667 204 0.7272 0.5453 0.7272 0.8528
No log 22.8889 206 0.7376 0.5603 0.7376 0.8588
No log 23.1111 208 0.7644 0.5451 0.7644 0.8743
No log 23.3333 210 0.7904 0.5410 0.7904 0.8890
No log 23.5556 212 0.7836 0.5452 0.7836 0.8852
No log 23.7778 214 0.7870 0.5144 0.7870 0.8871
No log 24.0 216 0.8933 0.4724 0.8933 0.9452
No log 24.2222 218 0.9679 0.4828 0.9679 0.9838
No log 24.4444 220 0.9072 0.4934 0.9072 0.9525
No log 24.6667 222 0.7933 0.5375 0.7933 0.8907
No log 24.8889 224 0.7385 0.6017 0.7385 0.8594
No log 25.1111 226 0.7639 0.5226 0.7639 0.8740
No log 25.3333 228 0.7883 0.5216 0.7883 0.8879
No log 25.5556 230 0.7786 0.5410 0.7786 0.8824
No log 25.7778 232 0.7892 0.5288 0.7892 0.8883
No log 26.0 234 0.8223 0.5154 0.8223 0.9068
No log 26.2222 236 0.8153 0.5245 0.8153 0.9030
No log 26.4444 238 0.8026 0.5256 0.8026 0.8959
No log 26.6667 240 0.8042 0.5466 0.8042 0.8968
No log 26.8889 242 0.8032 0.5466 0.8032 0.8962
No log 27.1111 244 0.8050 0.5203 0.8050 0.8972
No log 27.3333 246 0.7998 0.5388 0.7998 0.8943
No log 27.5556 248 0.7923 0.5333 0.7923 0.8901
No log 27.7778 250 0.7708 0.5426 0.7708 0.8780
No log 28.0 252 0.7750 0.5535 0.7750 0.8803
No log 28.2222 254 0.7808 0.5535 0.7808 0.8837
No log 28.4444 256 0.7952 0.5388 0.7952 0.8917
No log 28.6667 258 0.7990 0.5324 0.7990 0.8939
No log 28.8889 260 0.7993 0.5126 0.7993 0.8940
No log 29.1111 262 0.8009 0.5058 0.8009 0.8949
No log 29.3333 264 0.7830 0.5104 0.7830 0.8849
No log 29.5556 266 0.7621 0.5551 0.7621 0.8730
No log 29.7778 268 0.7595 0.5873 0.7595 0.8715
No log 30.0 270 0.7483 0.5699 0.7483 0.8651
No log 30.2222 272 0.7368 0.5967 0.7368 0.8584
No log 30.4444 274 0.7597 0.5472 0.7597 0.8716
No log 30.6667 276 0.7895 0.5216 0.7895 0.8885
No log 30.8889 278 0.8051 0.5129 0.8051 0.8973
No log 31.1111 280 0.8247 0.4871 0.8247 0.9081
No log 31.3333 282 0.8171 0.5179 0.8171 0.9040
No log 31.5556 284 0.8160 0.5032 0.8160 0.9033
No log 31.7778 286 0.8315 0.5199 0.8315 0.9119
No log 32.0 288 0.8463 0.5073 0.8463 0.9199
No log 32.2222 290 0.8594 0.5169 0.8594 0.9270
No log 32.4444 292 0.8729 0.5197 0.8729 0.9343
No log 32.6667 294 0.8634 0.5085 0.8634 0.9292
No log 32.8889 296 0.8449 0.5231 0.8449 0.9192
No log 33.1111 298 0.8413 0.5251 0.8413 0.9172
No log 33.3333 300 0.8490 0.5309 0.8490 0.9214
No log 33.5556 302 0.8783 0.5103 0.8783 0.9372
No log 33.7778 304 0.8997 0.5084 0.8997 0.9485
No log 34.0 306 0.9022 0.4847 0.9022 0.9499
No log 34.2222 308 0.9052 0.5002 0.9052 0.9514
No log 34.4444 310 0.9466 0.4650 0.9466 0.9730
No log 34.6667 312 0.9573 0.4680 0.9573 0.9784
No log 34.8889 314 0.9003 0.4701 0.9003 0.9489
No log 35.1111 316 0.8388 0.5476 0.8388 0.9159
No log 35.3333 318 0.8320 0.5508 0.8320 0.9121
No log 35.5556 320 0.8302 0.5419 0.8302 0.9111
No log 35.7778 322 0.8304 0.5548 0.8304 0.9113
No log 36.0 324 0.8354 0.5328 0.8354 0.9140
No log 36.2222 326 0.8441 0.5427 0.8441 0.9187
No log 36.4444 328 0.8716 0.5038 0.8716 0.9336
No log 36.6667 330 0.8723 0.5002 0.8723 0.9340
No log 36.8889 332 0.8786 0.4880 0.8786 0.9373
No log 37.1111 334 0.8691 0.5150 0.8691 0.9323
No log 37.3333 336 0.8583 0.4985 0.8583 0.9264
No log 37.5556 338 0.8467 0.5240 0.8467 0.9202
No log 37.7778 340 0.8341 0.5162 0.8341 0.9133
No log 38.0 342 0.8408 0.5088 0.8408 0.9169
No log 38.2222 344 0.8376 0.5155 0.8376 0.9152
No log 38.4444 346 0.8122 0.5296 0.8122 0.9012
No log 38.6667 348 0.7967 0.5228 0.7967 0.8926
No log 38.8889 350 0.7914 0.5335 0.7914 0.8896
No log 39.1111 352 0.8021 0.4901 0.8021 0.8956
No log 39.3333 354 0.8363 0.4801 0.8363 0.9145
No log 39.5556 356 0.8381 0.4684 0.8381 0.9155
No log 39.7778 358 0.8201 0.5281 0.8201 0.9056
No log 40.0 360 0.8253 0.5400 0.8253 0.9085
No log 40.2222 362 0.8311 0.5439 0.8311 0.9116
No log 40.4444 364 0.8467 0.4959 0.8467 0.9202
No log 40.6667 366 0.9065 0.4671 0.9065 0.9521
No log 40.8889 368 0.9244 0.4750 0.9244 0.9615
No log 41.1111 370 0.8865 0.4671 0.8865 0.9416
No log 41.3333 372 0.8432 0.5176 0.8432 0.9183
No log 41.5556 374 0.8451 0.5206 0.8451 0.9193
No log 41.7778 376 0.8494 0.5029 0.8494 0.9216
No log 42.0 378 0.8599 0.4940 0.8599 0.9273
No log 42.2222 380 0.8791 0.4685 0.8791 0.9376
No log 42.4444 382 0.9012 0.4675 0.9012 0.9493
No log 42.6667 384 0.9107 0.4675 0.9107 0.9543
No log 42.8889 386 0.8773 0.4643 0.8773 0.9367
No log 43.1111 388 0.8592 0.5191 0.8592 0.9269
No log 43.3333 390 0.8716 0.4816 0.8716 0.9336
No log 43.5556 392 0.8810 0.4826 0.8810 0.9386
No log 43.7778 394 0.8813 0.4781 0.8813 0.9388
No log 44.0 396 0.8837 0.4835 0.8837 0.9400
No log 44.2222 398 0.8884 0.4684 0.8884 0.9426
No log 44.4444 400 0.8766 0.4835 0.8766 0.9363
No log 44.6667 402 0.8668 0.4688 0.8668 0.9310
No log 44.8889 404 0.8678 0.4601 0.8678 0.9316
No log 45.1111 406 0.8736 0.4482 0.8736 0.9347
No log 45.3333 408 0.8999 0.4647 0.8999 0.9486
No log 45.5556 410 0.9122 0.4548 0.9122 0.9551
No log 45.7778 412 0.8921 0.4399 0.8921 0.9445
No log 46.0 414 0.8743 0.4383 0.8743 0.9350
No log 46.2222 416 0.8626 0.4546 0.8626 0.9287
No log 46.4444 418 0.8537 0.5133 0.8537 0.9240
No log 46.6667 420 0.8426 0.5005 0.8426 0.9179
No log 46.8889 422 0.8347 0.4920 0.8347 0.9136
No log 47.1111 424 0.8350 0.4970 0.8350 0.9138
No log 47.3333 426 0.8461 0.4891 0.8461 0.9199
No log 47.5556 428 0.8585 0.4920 0.8585 0.9266
No log 47.7778 430 0.8657 0.4921 0.8657 0.9304
No log 48.0 432 0.8789 0.4540 0.8789 0.9375
No log 48.2222 434 0.8994 0.4569 0.8994 0.9484
No log 48.4444 436 0.9605 0.4735 0.9605 0.9800
No log 48.6667 438 0.9874 0.4735 0.9874 0.9937
No log 48.8889 440 0.9546 0.4671 0.9546 0.9770
No log 49.1111 442 0.9435 0.4349 0.9435 0.9714
No log 49.3333 444 0.9307 0.4494 0.9307 0.9647
No log 49.5556 446 0.9068 0.4642 0.9068 0.9523
No log 49.7778 448 0.8986 0.4737 0.8986 0.9479
No log 50.0 450 0.8950 0.4737 0.8950 0.9460
No log 50.2222 452 0.8873 0.4651 0.8873 0.9420
No log 50.4444 454 0.8743 0.4735 0.8743 0.9350
No log 50.6667 456 0.8660 0.4709 0.8660 0.9306
No log 50.8889 458 0.8744 0.4626 0.8744 0.9351
No log 51.1111 460 0.8755 0.4699 0.8755 0.9357
No log 51.3333 462 0.8662 0.4692 0.8662 0.9307
No log 51.5556 464 0.8773 0.4659 0.8773 0.9367
No log 51.7778 466 0.8924 0.4756 0.8924 0.9447
No log 52.0 468 0.9044 0.4613 0.9044 0.9510
No log 52.2222 470 0.9170 0.4622 0.9170 0.9576
No log 52.4444 472 0.9387 0.4406 0.9387 0.9689
No log 52.6667 474 0.9399 0.4406 0.9399 0.9695
No log 52.8889 476 0.9266 0.4582 0.9266 0.9626
No log 53.1111 478 0.9209 0.4745 0.9209 0.9596
No log 53.3333 480 0.9225 0.4976 0.9225 0.9605
No log 53.5556 482 0.9018 0.4768 0.9018 0.9496
No log 53.7778 484 0.8807 0.4735 0.8807 0.9385
No log 54.0 486 0.8683 0.4735 0.8683 0.9318
No log 54.2222 488 0.8643 0.4709 0.8643 0.9297
No log 54.4444 490 0.8578 0.4821 0.8578 0.9262
No log 54.6667 492 0.8511 0.4622 0.8511 0.9225
No log 54.8889 494 0.8559 0.4650 0.8559 0.9252
No log 55.1111 496 0.8617 0.4762 0.8617 0.9283
No log 55.3333 498 0.8712 0.4917 0.8712 0.9334
0.5185 55.5556 500 0.8745 0.4768 0.8745 0.9351
0.5185 55.7778 502 0.8731 0.4909 0.8731 0.9344
0.5185 56.0 504 0.8776 0.5015 0.8776 0.9368
0.5185 56.2222 506 0.8767 0.5008 0.8767 0.9363
0.5185 56.4444 508 0.8768 0.5008 0.8768 0.9364
0.5185 56.6667 510 0.8776 0.4747 0.8776 0.9368
0.5185 56.8889 512 0.8932 0.4414 0.8932 0.9451
0.5185 57.1111 514 0.9193 0.4607 0.9193 0.9588
0.5185 57.3333 516 0.9314 0.4603 0.9314 0.9651
0.5185 57.5556 518 0.9272 0.4300 0.9272 0.9629

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.4.1+cu121
  • Datasets 3.2.0
  • Tokenizers 0.19.1
Downloads last month
-
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for MayBashendy/ArabicNewSplits6_FineTuningAraBERTFreeze_run3_AugV5_k3_task2_organization

Finetuned
(4023)
this model