ArabicNewSplits7_B_usingALLEssays_FineTuningAraBERT_run2_AugV5_k4_task1_organization

This model is a fine-tuned version of aubmindlab/bert-base-arabertv02 on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 1.7465
  • Qwk: 0.5320
  • Mse: 1.7465
  • Rmse: 1.3216

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 100

Training results

Training Loss Epoch Step Validation Loss Qwk Mse Rmse
No log 0.1111 2 6.8468 0.0242 6.8468 2.6166
No log 0.2222 4 4.6415 0.0549 4.6415 2.1544
No log 0.3333 6 3.6682 0.0 3.6682 1.9153
No log 0.4444 8 3.2306 -0.0312 3.2306 1.7974
No log 0.5556 10 2.6133 0.1655 2.6133 1.6166
No log 0.6667 12 2.3114 0.2270 2.3114 1.5203
No log 0.7778 14 2.1183 0.2143 2.1183 1.4554
No log 0.8889 16 2.5398 0.2086 2.5398 1.5937
No log 1.0 18 2.9574 0.2341 2.9574 1.7197
No log 1.1111 20 2.3815 0.3034 2.3815 1.5432
No log 1.2222 22 2.1068 0.3165 2.1068 1.4515
No log 1.3333 24 1.9698 0.2590 1.9698 1.4035
No log 1.4444 26 2.0894 0.3137 2.0894 1.4455
No log 1.5556 28 2.3093 0.3412 2.3093 1.5197
No log 1.6667 30 2.7019 0.2843 2.7019 1.6437
No log 1.7778 32 2.4037 0.2840 2.4037 1.5504
No log 1.8889 34 2.0115 0.3087 2.0115 1.4183
No log 2.0 36 1.2557 0.5156 1.2557 1.1206
No log 2.1111 38 1.3308 0.4262 1.3308 1.1536
No log 2.2222 40 1.3893 0.4298 1.3893 1.1787
No log 2.3333 42 1.2310 0.5556 1.2310 1.1095
No log 2.4444 44 1.3976 0.4088 1.3976 1.1822
No log 2.5556 46 2.5206 0.2955 2.5206 1.5876
No log 2.6667 48 2.9436 0.2617 2.9436 1.7157
No log 2.7778 50 2.6390 0.3220 2.6390 1.6245
No log 2.8889 52 2.2016 0.4420 2.2016 1.4838
No log 3.0 54 1.8901 0.4311 1.8901 1.3748
No log 3.1111 56 1.8146 0.4311 1.8146 1.3471
No log 3.2222 58 2.0441 0.4556 2.0441 1.4297
No log 3.3333 60 2.2144 0.4108 2.2144 1.4881
No log 3.4444 62 2.3137 0.4108 2.3137 1.5211
No log 3.5556 64 2.3185 0.4108 2.3185 1.5227
No log 3.6667 66 2.0726 0.3953 2.0726 1.4397
No log 3.7778 68 2.2708 0.4130 2.2708 1.5069
No log 3.8889 70 2.5912 0.3725 2.5912 1.6097
No log 4.0 72 2.2350 0.4149 2.2350 1.4950
No log 4.1111 74 1.9899 0.4541 1.9899 1.4106
No log 4.2222 76 2.4938 0.3846 2.4938 1.5792
No log 4.3333 78 3.2165 0.2532 3.2165 1.7935
No log 4.4444 80 3.5621 0.1893 3.5621 1.8873
No log 4.5556 82 2.6704 0.3791 2.6704 1.6341
No log 4.6667 84 1.4351 0.5486 1.4351 1.1980
No log 4.7778 86 1.2207 0.6405 1.2207 1.1049
No log 4.8889 88 1.2351 0.6040 1.2351 1.1114
No log 5.0 90 1.3823 0.5922 1.3823 1.1757
No log 5.1111 92 2.1924 0.4762 2.1924 1.4807
No log 5.2222 94 3.1046 0.3038 3.1046 1.7620
No log 5.3333 96 3.0995 0.3117 3.0995 1.7605
No log 5.4444 98 2.2369 0.4082 2.2369 1.4956
No log 5.5556 100 1.3396 0.5665 1.3396 1.1574
No log 5.6667 102 1.1234 0.6483 1.1234 1.0599
No log 5.7778 104 1.2048 0.5857 1.2048 1.0976
No log 5.8889 106 1.4243 0.475 1.4243 1.1934
No log 6.0 108 2.1812 0.4677 2.1812 1.4769
No log 6.1111 110 2.7292 0.3610 2.7292 1.6520
No log 6.2222 112 2.7245 0.3610 2.7245 1.6506
No log 6.3333 114 2.0748 0.4688 2.0748 1.4404
No log 6.4444 116 1.5849 0.5193 1.5849 1.2589
No log 6.5556 118 1.4408 0.6044 1.4408 1.2003
No log 6.6667 120 1.5632 0.5608 1.5632 1.2503
No log 6.7778 122 1.5534 0.5608 1.5534 1.2464
No log 6.8889 124 1.8452 0.5446 1.8452 1.3584
No log 7.0 126 1.7354 0.5463 1.7354 1.3173
No log 7.1111 128 1.6432 0.5616 1.6432 1.2819
No log 7.2222 130 1.6562 0.5616 1.6562 1.2869
No log 7.3333 132 1.4066 0.6073 1.4066 1.1860
No log 7.4444 134 1.2292 0.6145 1.2292 1.1087
No log 7.5556 136 1.2697 0.5870 1.2697 1.1268
No log 7.6667 138 1.7136 0.5616 1.7136 1.3090
No log 7.7778 140 1.8457 0.5320 1.8457 1.3586
No log 7.8889 142 1.5288 0.5714 1.5288 1.2364
No log 8.0 144 1.2730 0.5509 1.2730 1.1283
No log 8.1111 146 1.2561 0.5789 1.2561 1.1208
No log 8.2222 148 1.4045 0.4512 1.4045 1.1851
No log 8.3333 150 1.8000 0.5134 1.8000 1.3416
No log 8.4444 152 2.1627 0.4757 2.1627 1.4706
No log 8.5556 154 2.0124 0.4848 2.0124 1.4186
No log 8.6667 156 1.4748 0.5119 1.4748 1.2144
No log 8.7778 158 1.1949 0.6069 1.1949 1.0931
No log 8.8889 160 1.2035 0.6164 1.2035 1.0970
No log 9.0 162 1.2356 0.6069 1.2356 1.1116
No log 9.1111 164 1.2836 0.5443 1.2836 1.1329
No log 9.2222 166 1.6598 0.5087 1.6598 1.2883
No log 9.3333 168 2.5058 0.3779 2.5058 1.5830
No log 9.4444 170 2.8236 0.375 2.8236 1.6804
No log 9.5556 172 1.9768 0.4876 1.9768 1.4060
No log 9.6667 174 1.4904 0.5895 1.4904 1.2208
No log 9.7778 176 1.2607 0.6145 1.2607 1.1228
No log 9.8889 178 1.2242 0.6514 1.2242 1.1064
No log 10.0 180 1.0401 0.7018 1.0401 1.0199
No log 10.1111 182 0.9581 0.6707 0.9581 0.9788
No log 10.2222 184 1.0033 0.6786 1.0033 1.0017
No log 10.3333 186 1.2455 0.6304 1.2455 1.1160
No log 10.4444 188 1.8818 0.5050 1.8818 1.3718
No log 10.5556 190 2.2600 0.4608 2.2600 1.5033
No log 10.6667 192 1.9176 0.5192 1.9176 1.3848
No log 10.7778 194 1.4391 0.5895 1.4391 1.1996
No log 10.8889 196 1.1199 0.6744 1.1199 1.0583
No log 11.0 198 1.1502 0.6628 1.1502 1.0725
No log 11.1111 200 1.5282 0.5561 1.5282 1.2362
No log 11.2222 202 1.9516 0.4831 1.9516 1.3970
No log 11.3333 204 1.8868 0.49 1.8868 1.3736
No log 11.4444 206 1.7993 0.4845 1.7993 1.3414
No log 11.5556 208 1.6976 0.5340 1.6976 1.3029
No log 11.6667 210 1.6173 0.5864 1.6173 1.2717
No log 11.7778 212 1.9411 0.5050 1.9411 1.3932
No log 11.8889 214 1.7269 0.5492 1.7269 1.3141
No log 12.0 216 1.5344 0.5492 1.5344 1.2387
No log 12.1111 218 1.3552 0.6044 1.3552 1.1641
No log 12.2222 220 1.2048 0.6441 1.2048 1.0976
No log 12.3333 222 1.3088 0.6180 1.3088 1.1440
No log 12.4444 224 1.5446 0.5397 1.5446 1.2428
No log 12.5556 226 1.5014 0.5455 1.5014 1.2253
No log 12.6667 228 1.4093 0.5946 1.4093 1.1872
No log 12.7778 230 1.1609 0.6358 1.1609 1.0775
No log 12.8889 232 1.0558 0.7045 1.0558 1.0275
No log 13.0 234 1.1436 0.6740 1.1436 1.0694
No log 13.1111 236 1.4420 0.5381 1.4420 1.2008
No log 13.2222 238 1.6156 0.5455 1.6156 1.2711
No log 13.3333 240 1.3721 0.6425 1.3721 1.1714
No log 13.4444 242 1.4088 0.6425 1.4088 1.1869
No log 13.5556 244 1.4643 0.6114 1.4643 1.2101
No log 13.6667 246 1.4172 0.6096 1.4172 1.1905
No log 13.7778 248 1.4363 0.5806 1.4363 1.1985
No log 13.8889 250 1.3989 0.5856 1.3989 1.1827
No log 14.0 252 1.3412 0.6207 1.3412 1.1581
No log 14.1111 254 1.2783 0.6316 1.2783 1.1306
No log 14.2222 256 1.2059 0.6265 1.2059 1.0981
No log 14.3333 258 1.2522 0.6092 1.2522 1.1190
No log 14.4444 260 1.3911 0.6154 1.3911 1.1794
No log 14.5556 262 1.5137 0.5670 1.5137 1.2303
No log 14.6667 264 1.5682 0.5436 1.5682 1.2523
No log 14.7778 266 1.4670 0.5744 1.4670 1.2112
No log 14.8889 268 1.3378 0.5978 1.3378 1.1566
No log 15.0 270 1.5144 0.5436 1.5144 1.2306
No log 15.1111 272 1.7477 0.5373 1.7477 1.3220
No log 15.2222 274 1.8266 0.5411 1.8266 1.3515
No log 15.3333 276 1.5722 0.5612 1.5722 1.2539
No log 15.4444 278 1.3750 0.5926 1.3750 1.1726
No log 15.5556 280 1.1552 0.6296 1.1552 1.0748
No log 15.6667 282 1.0238 0.6358 1.0238 1.0118
No log 15.7778 284 1.0123 0.6579 1.0123 1.0061
No log 15.8889 286 1.1305 0.6386 1.1305 1.0633
No log 16.0 288 1.5801 0.5381 1.5801 1.2570
No log 16.1111 290 1.7438 0.53 1.7438 1.3205
No log 16.2222 292 1.8524 0.5373 1.8524 1.3610
No log 16.3333 294 1.9904 0.5263 1.9904 1.4108
No log 16.4444 296 1.8322 0.5373 1.8322 1.3536
No log 16.5556 298 1.7012 0.5185 1.7012 1.3043
No log 16.6667 300 1.5643 0.5319 1.5643 1.2507
No log 16.7778 302 1.4406 0.5333 1.4406 1.2002
No log 16.8889 304 1.4027 0.5089 1.4027 1.1843
No log 17.0 306 1.5883 0.5161 1.5883 1.2603
No log 17.1111 308 1.6583 0.5185 1.6583 1.2878
No log 17.2222 310 1.4151 0.5484 1.4151 1.1896
No log 17.3333 312 1.2076 0.6628 1.2076 1.0989
No log 17.4444 314 1.1659 0.6629 1.1659 1.0798
No log 17.5556 316 1.1819 0.6667 1.1819 1.0871
No log 17.6667 318 1.1526 0.6740 1.1526 1.0736
No log 17.7778 320 1.1659 0.6593 1.1659 1.0798
No log 17.8889 322 1.1980 0.6596 1.1980 1.0945
No log 18.0 324 1.3602 0.6146 1.3602 1.1663
No log 18.1111 326 1.6318 0.5592 1.6318 1.2774
No log 18.2222 328 1.7141 0.5472 1.7141 1.3092
No log 18.3333 330 1.4330 0.5907 1.4330 1.1971
No log 18.4444 332 1.1987 0.6102 1.1987 1.0949
No log 18.5556 334 1.1068 0.6514 1.1068 1.0521
No log 18.6667 336 1.2330 0.6102 1.2330 1.1104
No log 18.7778 338 1.4022 0.6111 1.4022 1.1841
No log 18.8889 340 1.3913 0.5909 1.3913 1.1795
No log 19.0 342 1.5235 0.5667 1.5235 1.2343
No log 19.1111 344 1.3536 0.6067 1.3536 1.1634
No log 19.2222 346 1.1047 0.6190 1.1047 1.0511
No log 19.3333 348 0.9443 0.6795 0.9443 0.9717
No log 19.4444 350 0.9406 0.6667 0.9406 0.9699
No log 19.5556 352 1.0394 0.6590 1.0394 1.0195
No log 19.6667 354 1.2997 0.5967 1.2997 1.1400
No log 19.7778 356 1.4513 0.5914 1.4513 1.2047
No log 19.8889 358 1.3927 0.6011 1.3927 1.1801
No log 20.0 360 1.3212 0.6145 1.3212 1.1494
No log 20.1111 362 1.1635 0.6067 1.1635 1.0787
No log 20.2222 364 1.1769 0.6188 1.1769 1.0849
No log 20.3333 366 1.3793 0.5746 1.3793 1.1744
No log 20.4444 368 1.4686 0.5746 1.4686 1.2118
No log 20.5556 370 1.3330 0.5747 1.3330 1.1546
No log 20.6667 372 1.1547 0.6076 1.1547 1.0745
No log 20.7778 374 0.9797 0.6069 0.9797 0.9898
No log 20.8889 376 0.9288 0.6479 0.9288 0.9637
No log 21.0 378 0.9880 0.6164 0.9880 0.9940
No log 21.1111 380 1.1918 0.5988 1.1918 1.0917
No log 21.2222 382 1.5025 0.5682 1.5025 1.2258
No log 21.3333 384 1.6082 0.5761 1.6082 1.2681
No log 21.4444 386 1.5796 0.5838 1.5796 1.2568
No log 21.5556 388 1.5846 0.5838 1.5846 1.2588
No log 21.6667 390 1.5492 0.5838 1.5492 1.2447
No log 21.7778 392 1.4526 0.5761 1.4526 1.2053
No log 21.8889 394 1.3504 0.5946 1.3504 1.1621
No log 22.0 396 1.4497 0.5761 1.4497 1.2041
No log 22.1111 398 1.5921 0.5455 1.5921 1.2618
No log 22.2222 400 1.4697 0.5761 1.4697 1.2123
No log 22.3333 402 1.3288 0.6292 1.3288 1.1527
No log 22.4444 404 1.1724 0.5952 1.1724 1.0828
No log 22.5556 406 1.1693 0.6061 1.1693 1.0813
No log 22.6667 408 1.3036 0.625 1.3036 1.1418
No log 22.7778 410 1.4612 0.5761 1.4612 1.2088
No log 22.8889 412 1.5585 0.5833 1.5585 1.2484
No log 23.0 414 1.3760 0.5870 1.3760 1.1730
No log 23.1111 416 1.1576 0.6180 1.1576 1.0759
No log 23.2222 418 1.0295 0.6587 1.0295 1.0146
No log 23.3333 420 1.0303 0.6543 1.0303 1.0151
No log 23.4444 422 1.1432 0.6395 1.1432 1.0692
No log 23.5556 424 1.5041 0.6129 1.5041 1.2264
No log 23.6667 426 1.9284 0.5217 1.9284 1.3887
No log 23.7778 428 2.2810 0.4340 2.2810 1.5103
No log 23.8889 430 2.1676 0.4593 2.1676 1.4723
No log 24.0 432 1.7406 0.5258 1.7406 1.3193
No log 24.1111 434 1.3340 0.5763 1.3340 1.1550
No log 24.2222 436 1.1398 0.6087 1.1398 1.0676
No log 24.3333 438 1.0984 0.6164 1.0984 1.0481
No log 24.4444 440 1.1596 0.6303 1.1596 1.0768
No log 24.5556 442 1.4142 0.6222 1.4142 1.1892
No log 24.6667 444 1.7212 0.5106 1.7212 1.3120
No log 24.7778 446 1.8292 0.4086 1.8292 1.3525
No log 24.8889 448 1.5978 0.5217 1.5978 1.2640
No log 25.0 450 1.4470 0.5746 1.4470 1.2029
No log 25.1111 452 1.4166 0.5810 1.4166 1.1902
No log 25.2222 454 1.3359 0.6102 1.3359 1.1558
No log 25.3333 456 1.2434 0.6163 1.2434 1.1151
No log 25.4444 458 1.2449 0.6 1.2449 1.1157
No log 25.5556 460 1.3956 0.6067 1.3956 1.1814
No log 25.6667 462 1.5761 0.5789 1.5761 1.2554
No log 25.7778 464 1.8954 0.4948 1.8954 1.3767
No log 25.8889 466 1.9957 0.4615 1.9957 1.4127
No log 26.0 468 1.8390 0.4737 1.8390 1.3561
No log 26.1111 470 1.4116 0.6102 1.4116 1.1881
No log 26.2222 472 1.0842 0.6275 1.0842 1.0413
No log 26.3333 474 1.0009 0.6405 1.0009 1.0004
No log 26.4444 476 0.9795 0.6494 0.9795 0.9897
No log 26.5556 478 1.0297 0.65 1.0297 1.0147
No log 26.6667 480 1.1898 0.6108 1.1898 1.0908
No log 26.7778 482 1.4596 0.5683 1.4596 1.2081
No log 26.8889 484 1.6103 0.5561 1.6103 1.2690
No log 27.0 486 1.5411 0.5622 1.5411 1.2414
No log 27.1111 488 1.3088 0.5870 1.3088 1.1440
No log 27.2222 490 1.1308 0.6480 1.1308 1.0634
No log 27.3333 492 1.1116 0.6480 1.1116 1.0543
No log 27.4444 494 1.1553 0.6667 1.1553 1.0749
No log 27.5556 496 1.1153 0.6628 1.1153 1.0561
No log 27.6667 498 1.0143 0.6460 1.0143 1.0071
0.3702 27.7778 500 0.9431 0.6234 0.9431 0.9711
0.3702 27.8889 502 0.9276 0.6154 0.9276 0.9631
0.3702 28.0 504 0.9448 0.6014 0.9448 0.9720
0.3702 28.1111 506 1.0254 0.6792 1.0254 1.0126
0.3702 28.2222 508 1.2043 0.6509 1.2043 1.0974
0.3702 28.3333 510 1.4090 0.5652 1.4090 1.1870
0.3702 28.4444 512 1.5250 0.5668 1.5250 1.2349
0.3702 28.5556 514 1.4305 0.5989 1.4305 1.1960
0.3702 28.6667 516 1.2862 0.5989 1.2862 1.1341
0.3702 28.7778 518 1.1436 0.6480 1.1436 1.0694
0.3702 28.8889 520 1.1807 0.6333 1.1807 1.0866
0.3702 29.0 522 1.2625 0.6054 1.2625 1.1236
0.3702 29.1111 524 1.3500 0.5870 1.3500 1.1619
0.3702 29.2222 526 1.4283 0.5870 1.4283 1.1951
0.3702 29.3333 528 1.6601 0.5528 1.6601 1.2884
0.3702 29.4444 530 1.7465 0.5320 1.7465 1.3216

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.4.0+cu118
  • Datasets 2.21.0
  • Tokenizers 0.19.1
Downloads last month
-
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for MayBashendy/ArabicNewSplits7_B_usingALLEssays_FineTuningAraBERT_run2_AugV5_k4_task1_organization

Finetuned
(4019)
this model