ArabicNewSplits4_WithDuplicationsForScore5_FineTuningAraBERT_run3_AugV5_k13_task5_organization
This model is a fine-tuned version of aubmindlab/bert-base-arabertv02 on the None dataset. It achieves the following results on the evaluation set:
- Loss: 0.9344
- Qwk: 0.6922
- Mse: 0.9344
- Rmse: 0.9666
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 2e-05
- train_batch_size: 8
- eval_batch_size: 8
- seed: 42
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- num_epochs: 10
Training results
| Training Loss | Epoch | Step | Validation Loss | Qwk | Mse | Rmse |
|---|---|---|---|---|---|---|
| No log | 0.0455 | 2 | 2.1680 | 0.0156 | 2.1680 | 1.4724 |
| No log | 0.0909 | 4 | 1.6328 | 0.0759 | 1.6328 | 1.2778 |
| No log | 0.1364 | 6 | 1.7104 | 0.1495 | 1.7104 | 1.3078 |
| No log | 0.1818 | 8 | 1.8170 | 0.1722 | 1.8170 | 1.3480 |
| No log | 0.2273 | 10 | 1.8184 | 0.2171 | 1.8184 | 1.3485 |
| No log | 0.2727 | 12 | 1.5745 | 0.2915 | 1.5745 | 1.2548 |
| No log | 0.3182 | 14 | 1.3895 | 0.2868 | 1.3895 | 1.1788 |
| No log | 0.3636 | 16 | 1.4465 | 0.3895 | 1.4465 | 1.2027 |
| No log | 0.4091 | 18 | 1.7245 | 0.2592 | 1.7245 | 1.3132 |
| No log | 0.4545 | 20 | 1.7516 | 0.2823 | 1.7516 | 1.3235 |
| No log | 0.5 | 22 | 1.7216 | 0.2741 | 1.7216 | 1.3121 |
| No log | 0.5455 | 24 | 1.7452 | 0.3527 | 1.7452 | 1.3211 |
| No log | 0.5909 | 26 | 1.8904 | 0.2585 | 1.8904 | 1.3749 |
| No log | 0.6364 | 28 | 1.9483 | 0.2440 | 1.9483 | 1.3958 |
| No log | 0.6818 | 30 | 1.8016 | 0.3158 | 1.8016 | 1.3423 |
| No log | 0.7273 | 32 | 1.6147 | 0.3312 | 1.6147 | 1.2707 |
| No log | 0.7727 | 34 | 1.6195 | 0.3318 | 1.6195 | 1.2726 |
| No log | 0.8182 | 36 | 1.7093 | 0.3179 | 1.7093 | 1.3074 |
| No log | 0.8636 | 38 | 2.0448 | 0.3231 | 2.0448 | 1.4300 |
| No log | 0.9091 | 40 | 2.4748 | 0.3008 | 2.4748 | 1.5732 |
| No log | 0.9545 | 42 | 2.5004 | 0.2930 | 2.5004 | 1.5813 |
| No log | 1.0 | 44 | 2.6207 | 0.3084 | 2.6207 | 1.6189 |
| No log | 1.0455 | 46 | 2.3959 | 0.3450 | 2.3959 | 1.5479 |
| No log | 1.0909 | 48 | 1.9170 | 0.3512 | 1.9170 | 1.3845 |
| No log | 1.1364 | 50 | 1.6163 | 0.4136 | 1.6163 | 1.2713 |
| No log | 1.1818 | 52 | 1.4347 | 0.4604 | 1.4347 | 1.1978 |
| No log | 1.2273 | 54 | 1.3761 | 0.4604 | 1.3761 | 1.1731 |
| No log | 1.2727 | 56 | 1.4589 | 0.4397 | 1.4589 | 1.2078 |
| No log | 1.3182 | 58 | 1.6848 | 0.3992 | 1.6848 | 1.2980 |
| No log | 1.3636 | 60 | 1.8731 | 0.3906 | 1.8731 | 1.3686 |
| No log | 1.4091 | 62 | 2.0583 | 0.3542 | 2.0583 | 1.4347 |
| No log | 1.4545 | 64 | 2.2413 | 0.3484 | 2.2413 | 1.4971 |
| No log | 1.5 | 66 | 2.1829 | 0.3439 | 2.1829 | 1.4775 |
| No log | 1.5455 | 68 | 2.0682 | 0.3909 | 2.0682 | 1.4381 |
| No log | 1.5909 | 70 | 2.0189 | 0.4207 | 2.0189 | 1.4209 |
| No log | 1.6364 | 72 | 1.9722 | 0.4055 | 1.9722 | 1.4044 |
| No log | 1.6818 | 74 | 1.8348 | 0.4221 | 1.8348 | 1.3545 |
| No log | 1.7273 | 76 | 1.7451 | 0.4499 | 1.7451 | 1.3210 |
| No log | 1.7727 | 78 | 1.6437 | 0.4535 | 1.6437 | 1.2821 |
| No log | 1.8182 | 80 | 1.5829 | 0.4680 | 1.5829 | 1.2581 |
| No log | 1.8636 | 82 | 1.6915 | 0.5012 | 1.6915 | 1.3006 |
| No log | 1.9091 | 84 | 1.5738 | 0.5111 | 1.5738 | 1.2545 |
| No log | 1.9545 | 86 | 1.3352 | 0.6001 | 1.3352 | 1.1555 |
| No log | 2.0 | 88 | 1.1906 | 0.6172 | 1.1906 | 1.0911 |
| No log | 2.0455 | 90 | 1.0371 | 0.6507 | 1.0371 | 1.0184 |
| No log | 2.0909 | 92 | 0.8948 | 0.6661 | 0.8948 | 0.9459 |
| No log | 2.1364 | 94 | 0.9398 | 0.6516 | 0.9398 | 0.9694 |
| No log | 2.1818 | 96 | 1.0247 | 0.6694 | 1.0247 | 1.0123 |
| No log | 2.2273 | 98 | 1.2178 | 0.6540 | 1.2178 | 1.1035 |
| No log | 2.2727 | 100 | 1.2186 | 0.6599 | 1.2186 | 1.1039 |
| No log | 2.3182 | 102 | 1.0245 | 0.6610 | 1.0245 | 1.0122 |
| No log | 2.3636 | 104 | 0.7633 | 0.6957 | 0.7633 | 0.8737 |
| No log | 2.4091 | 106 | 0.6927 | 0.6924 | 0.6927 | 0.8323 |
| No log | 2.4545 | 108 | 0.7726 | 0.6893 | 0.7726 | 0.8790 |
| No log | 2.5 | 110 | 0.9002 | 0.6839 | 0.9002 | 0.9488 |
| No log | 2.5455 | 112 | 1.1229 | 0.6547 | 1.1229 | 1.0597 |
| No log | 2.5909 | 114 | 1.2302 | 0.6269 | 1.2302 | 1.1091 |
| No log | 2.6364 | 116 | 1.0797 | 0.6601 | 1.0797 | 1.0391 |
| No log | 2.6818 | 118 | 0.8568 | 0.6638 | 0.8568 | 0.9256 |
| No log | 2.7273 | 120 | 0.7265 | 0.6875 | 0.7265 | 0.8523 |
| No log | 2.7727 | 122 | 0.7481 | 0.6707 | 0.7481 | 0.8649 |
| No log | 2.8182 | 124 | 0.8037 | 0.6582 | 0.8037 | 0.8965 |
| No log | 2.8636 | 126 | 0.9754 | 0.6382 | 0.9754 | 0.9876 |
| No log | 2.9091 | 128 | 1.1847 | 0.5768 | 1.1847 | 1.0885 |
| No log | 2.9545 | 130 | 1.3298 | 0.5298 | 1.3298 | 1.1532 |
| No log | 3.0 | 132 | 1.4823 | 0.5221 | 1.4823 | 1.2175 |
| No log | 3.0455 | 134 | 1.7293 | 0.5208 | 1.7293 | 1.3150 |
| No log | 3.0909 | 136 | 1.6952 | 0.5265 | 1.6952 | 1.3020 |
| No log | 3.1364 | 138 | 1.3039 | 0.5930 | 1.3039 | 1.1419 |
| No log | 3.1818 | 140 | 0.9400 | 0.6564 | 0.9400 | 0.9696 |
| No log | 3.2273 | 142 | 0.8238 | 0.6293 | 0.8238 | 0.9077 |
| No log | 3.2727 | 144 | 0.8374 | 0.6188 | 0.8374 | 0.9151 |
| No log | 3.3182 | 146 | 0.9445 | 0.6547 | 0.9445 | 0.9719 |
| No log | 3.3636 | 148 | 1.1680 | 0.5956 | 1.1680 | 1.0808 |
| No log | 3.4091 | 150 | 1.1466 | 0.6054 | 1.1466 | 1.0708 |
| No log | 3.4545 | 152 | 0.9797 | 0.6248 | 0.9797 | 0.9898 |
| No log | 3.5 | 154 | 1.0078 | 0.6277 | 1.0078 | 1.0039 |
| No log | 3.5455 | 156 | 1.0595 | 0.6249 | 1.0595 | 1.0293 |
| No log | 3.5909 | 158 | 1.1173 | 0.6418 | 1.1173 | 1.0570 |
| No log | 3.6364 | 160 | 1.1570 | 0.6577 | 1.1570 | 1.0756 |
| No log | 3.6818 | 162 | 1.2352 | 0.6212 | 1.2352 | 1.1114 |
| No log | 3.7273 | 164 | 1.1865 | 0.6484 | 1.1865 | 1.0893 |
| No log | 3.7727 | 166 | 1.0812 | 0.6584 | 1.0812 | 1.0398 |
| No log | 3.8182 | 168 | 0.8832 | 0.6944 | 0.8832 | 0.9398 |
| No log | 3.8636 | 170 | 0.7717 | 0.6890 | 0.7717 | 0.8785 |
| No log | 3.9091 | 172 | 0.8035 | 0.7072 | 0.8035 | 0.8964 |
| No log | 3.9545 | 174 | 0.8922 | 0.6914 | 0.8922 | 0.9445 |
| No log | 4.0 | 176 | 0.9968 | 0.6597 | 0.9968 | 0.9984 |
| No log | 4.0455 | 178 | 1.1357 | 0.6426 | 1.1357 | 1.0657 |
| No log | 4.0909 | 180 | 1.0741 | 0.6207 | 1.0741 | 1.0364 |
| No log | 4.1364 | 182 | 1.0084 | 0.6176 | 1.0084 | 1.0042 |
| No log | 4.1818 | 184 | 0.9837 | 0.6387 | 0.9837 | 0.9918 |
| No log | 4.2273 | 186 | 0.9254 | 0.6511 | 0.9254 | 0.9620 |
| No log | 4.2727 | 188 | 0.9646 | 0.6268 | 0.9646 | 0.9821 |
| No log | 4.3182 | 190 | 1.0571 | 0.5935 | 1.0571 | 1.0282 |
| No log | 4.3636 | 192 | 1.0901 | 0.5923 | 1.0901 | 1.0441 |
| No log | 4.4091 | 194 | 1.1256 | 0.5923 | 1.1256 | 1.0609 |
| No log | 4.4545 | 196 | 1.1697 | 0.6180 | 1.1697 | 1.0815 |
| No log | 4.5 | 198 | 1.1849 | 0.6252 | 1.1849 | 1.0885 |
| No log | 4.5455 | 200 | 1.1899 | 0.6252 | 1.1899 | 1.0908 |
| No log | 4.5909 | 202 | 1.1690 | 0.6290 | 1.1690 | 1.0812 |
| No log | 4.6364 | 204 | 1.1925 | 0.6290 | 1.1925 | 1.0920 |
| No log | 4.6818 | 206 | 1.0665 | 0.6106 | 1.0665 | 1.0327 |
| No log | 4.7273 | 208 | 0.9398 | 0.6425 | 0.9398 | 0.9694 |
| No log | 4.7727 | 210 | 0.8336 | 0.6502 | 0.8336 | 0.9130 |
| No log | 4.8182 | 212 | 0.8408 | 0.6582 | 0.8408 | 0.9170 |
| No log | 4.8636 | 214 | 0.9295 | 0.6525 | 0.9295 | 0.9641 |
| No log | 4.9091 | 216 | 0.9985 | 0.6673 | 0.9985 | 0.9993 |
| No log | 4.9545 | 218 | 1.1455 | 0.6465 | 1.1455 | 1.0703 |
| No log | 5.0 | 220 | 1.2507 | 0.6187 | 1.2507 | 1.1183 |
| No log | 5.0455 | 222 | 1.3115 | 0.5845 | 1.3115 | 1.1452 |
| No log | 5.0909 | 224 | 1.3094 | 0.5550 | 1.3094 | 1.1443 |
| No log | 5.1364 | 226 | 1.2638 | 0.5479 | 1.2638 | 1.1242 |
| No log | 5.1818 | 228 | 1.3529 | 0.5427 | 1.3529 | 1.1631 |
| No log | 5.2273 | 230 | 1.4165 | 0.5385 | 1.4165 | 1.1902 |
| No log | 5.2727 | 232 | 1.4201 | 0.5370 | 1.4201 | 1.1917 |
| No log | 5.3182 | 234 | 1.3216 | 0.5485 | 1.3216 | 1.1496 |
| No log | 5.3636 | 236 | 1.3148 | 0.5509 | 1.3148 | 1.1466 |
| No log | 5.4091 | 238 | 1.3565 | 0.5485 | 1.3565 | 1.1647 |
| No log | 5.4545 | 240 | 1.4511 | 0.5377 | 1.4511 | 1.2046 |
| No log | 5.5 | 242 | 1.4959 | 0.5519 | 1.4959 | 1.2231 |
| No log | 5.5455 | 244 | 1.4482 | 0.5526 | 1.4482 | 1.2034 |
| No log | 5.5909 | 246 | 1.2735 | 0.5655 | 1.2735 | 1.1285 |
| No log | 5.6364 | 248 | 1.0797 | 0.6282 | 1.0797 | 1.0391 |
| No log | 5.6818 | 250 | 0.9370 | 0.6512 | 0.9370 | 0.9680 |
| No log | 5.7273 | 252 | 0.9308 | 0.6351 | 0.9308 | 0.9648 |
| No log | 5.7727 | 254 | 0.9845 | 0.6479 | 0.9845 | 0.9922 |
| No log | 5.8182 | 256 | 1.0533 | 0.6426 | 1.0533 | 1.0263 |
| No log | 5.8636 | 258 | 1.0613 | 0.6426 | 1.0613 | 1.0302 |
| No log | 5.9091 | 260 | 0.9605 | 0.6456 | 0.9605 | 0.9800 |
| No log | 5.9545 | 262 | 0.8627 | 0.6651 | 0.8627 | 0.9288 |
| No log | 6.0 | 264 | 0.8373 | 0.6322 | 0.8373 | 0.9150 |
| No log | 6.0455 | 266 | 0.9099 | 0.6256 | 0.9099 | 0.9539 |
| No log | 6.0909 | 268 | 1.0063 | 0.6353 | 1.0063 | 1.0031 |
| No log | 6.1364 | 270 | 1.0420 | 0.6391 | 1.0420 | 1.0208 |
| No log | 6.1818 | 272 | 1.0156 | 0.6560 | 1.0156 | 1.0077 |
| No log | 6.2273 | 274 | 0.9735 | 0.6789 | 0.9735 | 0.9867 |
| No log | 6.2727 | 276 | 0.9509 | 0.6755 | 0.9509 | 0.9752 |
| No log | 6.3182 | 278 | 0.9839 | 0.6816 | 0.9839 | 0.9919 |
| No log | 6.3636 | 280 | 1.1088 | 0.6488 | 1.1088 | 1.0530 |
| No log | 6.4091 | 282 | 1.2706 | 0.5922 | 1.2706 | 1.1272 |
| No log | 6.4545 | 284 | 1.4608 | 0.5370 | 1.4608 | 1.2086 |
| No log | 6.5 | 286 | 1.5792 | 0.5335 | 1.5792 | 1.2567 |
| No log | 6.5455 | 288 | 1.5179 | 0.5270 | 1.5179 | 1.2320 |
| No log | 6.5909 | 290 | 1.3714 | 0.5621 | 1.3714 | 1.1711 |
| No log | 6.6364 | 292 | 1.2286 | 0.5879 | 1.2286 | 1.1084 |
| No log | 6.6818 | 294 | 1.1121 | 0.6478 | 1.1121 | 1.0546 |
| No log | 6.7273 | 296 | 1.0287 | 0.6650 | 1.0287 | 1.0142 |
| No log | 6.7727 | 298 | 1.0005 | 0.6714 | 1.0005 | 1.0002 |
| No log | 6.8182 | 300 | 0.9815 | 0.6864 | 0.9815 | 0.9907 |
| No log | 6.8636 | 302 | 0.9954 | 0.6633 | 0.9954 | 0.9977 |
| No log | 6.9091 | 304 | 0.9607 | 0.6912 | 0.9607 | 0.9802 |
| No log | 6.9545 | 306 | 0.9705 | 0.6844 | 0.9705 | 0.9851 |
| No log | 7.0 | 308 | 0.9995 | 0.6712 | 0.9995 | 0.9998 |
| No log | 7.0455 | 310 | 0.9887 | 0.6867 | 0.9887 | 0.9943 |
| No log | 7.0909 | 312 | 1.0008 | 0.6861 | 1.0008 | 1.0004 |
| No log | 7.1364 | 314 | 1.0351 | 0.6444 | 1.0351 | 1.0174 |
| No log | 7.1818 | 316 | 1.0076 | 0.6826 | 1.0076 | 1.0038 |
| No log | 7.2273 | 318 | 0.9409 | 0.6928 | 0.9409 | 0.9700 |
| No log | 7.2727 | 320 | 0.8692 | 0.6797 | 0.8692 | 0.9323 |
| No log | 7.3182 | 322 | 0.8206 | 0.7019 | 0.8206 | 0.9058 |
| No log | 7.3636 | 324 | 0.8371 | 0.7209 | 0.8371 | 0.9149 |
| No log | 7.4091 | 326 | 0.9219 | 0.7076 | 0.9219 | 0.9601 |
| No log | 7.4545 | 328 | 1.0373 | 0.6636 | 1.0373 | 1.0185 |
| No log | 7.5 | 330 | 1.0815 | 0.6621 | 1.0815 | 1.0400 |
| No log | 7.5455 | 332 | 1.1312 | 0.6492 | 1.1312 | 1.0636 |
| No log | 7.5909 | 334 | 1.1118 | 0.6492 | 1.1118 | 1.0544 |
| No log | 7.6364 | 336 | 1.0319 | 0.6800 | 1.0319 | 1.0158 |
| No log | 7.6818 | 338 | 0.9728 | 0.7033 | 0.9728 | 0.9863 |
| No log | 7.7273 | 340 | 0.9599 | 0.7169 | 0.9599 | 0.9797 |
| No log | 7.7727 | 342 | 0.9822 | 0.6947 | 0.9822 | 0.9910 |
| No log | 7.8182 | 344 | 0.9957 | 0.6947 | 0.9957 | 0.9978 |
| No log | 7.8636 | 346 | 0.9605 | 0.7169 | 0.9605 | 0.9801 |
| No log | 7.9091 | 348 | 0.9365 | 0.7176 | 0.9365 | 0.9677 |
| No log | 7.9545 | 350 | 0.9178 | 0.7182 | 0.9178 | 0.9580 |
| No log | 8.0 | 352 | 0.8933 | 0.7182 | 0.8933 | 0.9451 |
| No log | 8.0455 | 354 | 0.9109 | 0.7182 | 0.9109 | 0.9544 |
| No log | 8.0909 | 356 | 0.9569 | 0.7169 | 0.9569 | 0.9782 |
| No log | 8.1364 | 358 | 0.9569 | 0.7169 | 0.9569 | 0.9782 |
| No log | 8.1818 | 360 | 0.9276 | 0.7169 | 0.9276 | 0.9631 |
| No log | 8.2273 | 362 | 0.9107 | 0.7168 | 0.9107 | 0.9543 |
| No log | 8.2727 | 364 | 0.8754 | 0.7136 | 0.8754 | 0.9356 |
| No log | 8.3182 | 366 | 0.8455 | 0.7098 | 0.8455 | 0.9195 |
| No log | 8.3636 | 368 | 0.8407 | 0.7098 | 0.8407 | 0.9169 |
| No log | 8.4091 | 370 | 0.8605 | 0.7131 | 0.8605 | 0.9276 |
| No log | 8.4545 | 372 | 0.8996 | 0.7136 | 0.8996 | 0.9485 |
| No log | 8.5 | 374 | 0.9543 | 0.7058 | 0.9543 | 0.9769 |
| No log | 8.5455 | 376 | 1.0008 | 0.6774 | 1.0008 | 1.0004 |
| No log | 8.5909 | 378 | 1.0059 | 0.6774 | 1.0059 | 1.0029 |
| No log | 8.6364 | 380 | 0.9801 | 0.6998 | 0.9801 | 0.9900 |
| No log | 8.6818 | 382 | 0.9324 | 0.6998 | 0.9324 | 0.9656 |
| No log | 8.7273 | 384 | 0.8792 | 0.6811 | 0.8792 | 0.9377 |
| No log | 8.7727 | 386 | 0.8405 | 0.7059 | 0.8405 | 0.9168 |
| No log | 8.8182 | 388 | 0.8344 | 0.7059 | 0.8344 | 0.9134 |
| No log | 8.8636 | 390 | 0.8391 | 0.7059 | 0.8391 | 0.9160 |
| No log | 8.9091 | 392 | 0.8635 | 0.6908 | 0.8635 | 0.9293 |
| No log | 8.9545 | 394 | 0.8873 | 0.6811 | 0.8873 | 0.9420 |
| No log | 9.0 | 396 | 0.8982 | 0.6811 | 0.8982 | 0.9477 |
| No log | 9.0455 | 398 | 0.9033 | 0.6889 | 0.9033 | 0.9504 |
| No log | 9.0909 | 400 | 0.9248 | 0.6998 | 0.9248 | 0.9616 |
| No log | 9.1364 | 402 | 0.9538 | 0.6998 | 0.9538 | 0.9766 |
| No log | 9.1818 | 404 | 0.9850 | 0.6998 | 0.9850 | 0.9925 |
| No log | 9.2273 | 406 | 1.0120 | 0.6774 | 1.0120 | 1.0060 |
| No log | 9.2727 | 408 | 1.0326 | 0.6774 | 1.0326 | 1.0162 |
| No log | 9.3182 | 410 | 1.0308 | 0.6774 | 1.0308 | 1.0153 |
| No log | 9.3636 | 412 | 1.0208 | 0.6774 | 1.0208 | 1.0103 |
| No log | 9.4091 | 414 | 1.0040 | 0.6910 | 1.0040 | 1.0020 |
| No log | 9.4545 | 416 | 0.9920 | 0.6910 | 0.9920 | 0.9960 |
| No log | 9.5 | 418 | 0.9845 | 0.6833 | 0.9845 | 0.9922 |
| No log | 9.5455 | 420 | 0.9717 | 0.6922 | 0.9717 | 0.9857 |
| No log | 9.5909 | 422 | 0.9612 | 0.6922 | 0.9612 | 0.9804 |
| No log | 9.6364 | 424 | 0.9476 | 0.6922 | 0.9476 | 0.9734 |
| No log | 9.6818 | 426 | 0.9401 | 0.6922 | 0.9401 | 0.9696 |
| No log | 9.7273 | 428 | 0.9364 | 0.6922 | 0.9364 | 0.9677 |
| No log | 9.7727 | 430 | 0.9319 | 0.6922 | 0.9319 | 0.9653 |
| No log | 9.8182 | 432 | 0.9268 | 0.6922 | 0.9268 | 0.9627 |
| No log | 9.8636 | 434 | 0.9265 | 0.6922 | 0.9265 | 0.9626 |
| No log | 9.9091 | 436 | 0.9295 | 0.6922 | 0.9295 | 0.9641 |
| No log | 9.9545 | 438 | 0.9330 | 0.6922 | 0.9330 | 0.9659 |
| No log | 10.0 | 440 | 0.9344 | 0.6922 | 0.9344 | 0.9666 |
Framework versions
- Transformers 4.44.2
- Pytorch 2.4.0+cu118
- Datasets 2.21.0
- Tokenizers 0.19.1
- Downloads last month
- -
Model tree for MayBashendy/ArabicNewSplits4_WithDuplicationsForScore5_FineTuningAraBERT_run3_AugV5_k13_task5_organization
Base model
aubmindlab/bert-base-arabertv02