ArabicNewSplits7_FineTuningAraBERT_noAug_task3_organization

This model is a fine-tuned version of aubmindlab/bert-base-arabertv02 on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 0.8090
  • Qwk: 0.1095
  • Mse: 0.8090
  • Rmse: 0.8994

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 100

Training results

Training Loss Epoch Step Validation Loss Qwk Mse Rmse
No log 0.6667 2 5.5420 0.0030 5.5420 2.3541
No log 1.3333 4 3.1254 0.0002 3.1254 1.7679
No log 2.0 6 1.7162 0.0213 1.7162 1.3100
No log 2.6667 8 1.2102 -0.0133 1.2102 1.1001
No log 3.3333 10 1.2021 -0.0178 1.2021 1.0964
No log 4.0 12 0.8865 0.0748 0.8865 0.9416
No log 4.6667 14 0.9694 0.0623 0.9694 0.9846
No log 5.3333 16 0.7347 0.0260 0.7347 0.8571
No log 6.0 18 0.6818 0.0807 0.6818 0.8257
No log 6.6667 20 0.8893 0.1113 0.8893 0.9430
No log 7.3333 22 0.8323 0.0951 0.8323 0.9123
No log 8.0 24 0.7651 0.0884 0.7651 0.8747
No log 8.6667 26 0.7550 0.1620 0.7550 0.8689
No log 9.3333 28 0.8338 0.1592 0.8338 0.9131
No log 10.0 30 1.0524 0.1753 1.0524 1.0258
No log 10.6667 32 0.8584 0.1707 0.8584 0.9265
No log 11.3333 34 0.8218 0.1538 0.8218 0.9066
No log 12.0 36 1.0187 0.0764 1.0187 1.0093
No log 12.6667 38 1.0979 0.1246 1.0979 1.0478
No log 13.3333 40 0.8634 0.1201 0.8634 0.9292
No log 14.0 42 0.9601 0.1066 0.9601 0.9798
No log 14.6667 44 1.4010 0.1290 1.4010 1.1836
No log 15.3333 46 1.4809 0.1175 1.4809 1.2169
No log 16.0 48 1.0711 0.0995 1.0711 1.0349
No log 16.6667 50 1.1059 0.0600 1.1059 1.0516
No log 17.3333 52 1.3003 0.0056 1.3003 1.1403
No log 18.0 54 1.1468 0.0584 1.1468 1.0709
No log 18.6667 56 1.0487 0.0365 1.0487 1.0240
No log 19.3333 58 0.9480 0.0930 0.9480 0.9736
No log 20.0 60 0.9161 0.0856 0.9161 0.9572
No log 20.6667 62 0.8637 0.1529 0.8637 0.9293
No log 21.3333 64 0.8521 0.1529 0.8521 0.9231
No log 22.0 66 1.0071 0.0225 1.0071 1.0036
No log 22.6667 68 1.1649 0.0866 1.1649 1.0793
No log 23.3333 70 0.9809 0.0993 0.9809 0.9904
No log 24.0 72 0.9642 0.1705 0.9642 0.9819
No log 24.6667 74 0.9449 0.1638 0.9449 0.9721
No log 25.3333 76 0.9330 0.0851 0.9330 0.9659
No log 26.0 78 1.0442 0.0799 1.0442 1.0219
No log 26.6667 80 0.9999 0.0856 0.9999 1.0000
No log 27.3333 82 0.8117 0.1841 0.8117 0.9010
No log 28.0 84 0.7927 0.0834 0.7927 0.8904
No log 28.6667 86 0.8194 0.1702 0.8194 0.9052
No log 29.3333 88 0.8495 0.0923 0.8495 0.9217
No log 30.0 90 0.8640 0.0923 0.8640 0.9295
No log 30.6667 92 0.8946 0.0884 0.8946 0.9458
No log 31.3333 94 0.8656 0.2254 0.8656 0.9304
No log 32.0 96 0.8592 0.1313 0.8592 0.9269
No log 32.6667 98 0.9560 0.1027 0.9560 0.9778
No log 33.3333 100 0.9772 0.0217 0.9772 0.9885
No log 34.0 102 0.8714 0.0805 0.8714 0.9335
No log 34.6667 104 0.8205 0.0846 0.8205 0.9058
No log 35.3333 106 0.9025 0.1700 0.9025 0.9500
No log 36.0 108 0.9035 0.1131 0.9035 0.9505
No log 36.6667 110 0.9686 0.0933 0.9686 0.9842
No log 37.3333 112 1.1821 0.0704 1.1821 1.0872
No log 38.0 114 1.4063 0.0884 1.4063 1.1859
No log 38.6667 116 1.3013 0.0798 1.3013 1.1407
No log 39.3333 118 1.0749 0.1144 1.0749 1.0368
No log 40.0 120 0.9422 0.1010 0.9422 0.9707
No log 40.6667 122 0.8785 0.0361 0.8785 0.9373
No log 41.3333 124 0.8177 0.0700 0.8177 0.9043
No log 42.0 126 0.8195 0.0091 0.8195 0.9053
No log 42.6667 128 0.8602 0.0684 0.8602 0.9275
No log 43.3333 130 0.8793 0.0986 0.8793 0.9377
No log 44.0 132 0.8323 0.1190 0.8323 0.9123
No log 44.6667 134 0.8088 0.0597 0.8088 0.8993
No log 45.3333 136 0.8169 0.0597 0.8169 0.9038
No log 46.0 138 0.8986 0.1437 0.8986 0.9479
No log 46.6667 140 0.9207 0.1301 0.9207 0.9595
No log 47.3333 142 0.8466 0.0504 0.8466 0.9201
No log 48.0 144 0.7896 0.1240 0.7896 0.8886
No log 48.6667 146 0.7947 0.0840 0.7947 0.8914
No log 49.3333 148 0.8111 0.1518 0.8111 0.9006
No log 50.0 150 0.9068 0.0842 0.9068 0.9523
No log 50.6667 152 1.0347 0.0799 1.0347 1.0172
No log 51.3333 154 1.0026 0.1285 1.0026 1.0013
No log 52.0 156 0.9318 0.1353 0.9318 0.9653
No log 52.6667 158 0.9047 0.1050 0.9047 0.9512
No log 53.3333 160 0.8876 0.1465 0.8876 0.9421
No log 54.0 162 0.8811 0.0923 0.8811 0.9387
No log 54.6667 164 0.8536 0.0660 0.8536 0.9239
No log 55.3333 166 0.8457 0.1049 0.8457 0.9196
No log 56.0 168 0.8319 0.0660 0.8319 0.9121
No log 56.6667 170 0.8353 0.1095 0.8353 0.9139
No log 57.3333 172 0.8510 0.1003 0.8510 0.9225
No log 58.0 174 0.8389 0.0660 0.8389 0.9159
No log 58.6667 176 0.8407 0.0725 0.8407 0.9169
No log 59.3333 178 0.8471 0.0688 0.8471 0.9204
No log 60.0 180 0.8592 0.0408 0.8592 0.9269
No log 60.6667 182 0.8563 0.1094 0.8563 0.9254
No log 61.3333 184 0.8624 0.0964 0.8624 0.9287
No log 62.0 186 0.8726 0.0435 0.8726 0.9341
No log 62.6667 188 0.8523 0.0113 0.8523 0.9232
No log 63.3333 190 0.8311 0.1415 0.8311 0.9116
No log 64.0 192 0.8214 0.1465 0.8214 0.9063
No log 64.6667 194 0.8305 0.1094 0.8305 0.9113
No log 65.3333 196 0.8404 0.1094 0.8404 0.9167
No log 66.0 198 0.8687 0.1752 0.8687 0.9320
No log 66.6667 200 0.8926 0.1673 0.8926 0.9448
No log 67.3333 202 0.9188 0.0392 0.9188 0.9585
No log 68.0 204 0.8798 0.1673 0.8798 0.9380
No log 68.6667 206 0.8201 0.1094 0.8201 0.9056
No log 69.3333 208 0.7988 0.1094 0.7988 0.8937
No log 70.0 210 0.7839 0.1094 0.7839 0.8854
No log 70.6667 212 0.7808 0.1292 0.7808 0.8836
No log 71.3333 214 0.7814 0.1815 0.7814 0.8840
No log 72.0 216 0.7779 0.1292 0.7779 0.8820
No log 72.6667 218 0.7787 0.1240 0.7787 0.8824
No log 73.3333 220 0.7864 0.0660 0.7864 0.8868
No log 74.0 222 0.8027 0.1485 0.8027 0.8960
No log 74.6667 224 0.8140 0.1485 0.8140 0.9022
No log 75.3333 226 0.8172 0.1485 0.8172 0.9040
No log 76.0 228 0.8171 0.1485 0.8171 0.9040
No log 76.6667 230 0.8079 0.1095 0.8079 0.8988
No log 77.3333 232 0.7940 0.1189 0.7940 0.8911
No log 78.0 234 0.7891 0.1751 0.7891 0.8883
No log 78.6667 236 0.7894 0.1282 0.7894 0.8885
No log 79.3333 238 0.7957 0.0893 0.7957 0.8920
No log 80.0 240 0.7966 0.0893 0.7966 0.8925
No log 80.6667 242 0.7885 0.1282 0.7885 0.8880
No log 81.3333 244 0.7826 0.1689 0.7826 0.8847
No log 82.0 246 0.8029 0.1095 0.8029 0.8960
No log 82.6667 248 0.8306 0.1277 0.8306 0.9114
No log 83.3333 250 0.8493 0.1188 0.8493 0.9216
No log 84.0 252 0.8483 0.1188 0.8483 0.9210
No log 84.6667 254 0.8302 0.0959 0.8302 0.9112
No log 85.3333 256 0.8121 0.0959 0.8121 0.9012
No log 86.0 258 0.8024 0.1095 0.8024 0.8958
No log 86.6667 260 0.7921 0.1095 0.7921 0.8900
No log 87.3333 262 0.7805 0.1722 0.7805 0.8835
No log 88.0 264 0.7776 0.1722 0.7776 0.8818
No log 88.6667 266 0.7779 0.1722 0.7779 0.8820
No log 89.3333 268 0.7797 0.1722 0.7797 0.8830
No log 90.0 270 0.7864 0.1196 0.7864 0.8868
No log 90.6667 272 0.7953 0.1095 0.7953 0.8918
No log 91.3333 274 0.8012 0.1095 0.8012 0.8951
No log 92.0 276 0.8062 0.1095 0.8062 0.8979
No log 92.6667 278 0.8098 0.1095 0.8098 0.8999
No log 93.3333 280 0.8182 0.1095 0.8182 0.9045
No log 94.0 282 0.8247 0.1324 0.8247 0.9082
No log 94.6667 284 0.8266 0.1324 0.8266 0.9092
No log 95.3333 286 0.8239 0.1324 0.8239 0.9077
No log 96.0 288 0.8172 0.0611 0.8172 0.9040
No log 96.6667 290 0.8103 0.0611 0.8103 0.9002
No log 97.3333 292 0.8083 0.1095 0.8083 0.8991
No log 98.0 294 0.8087 0.1095 0.8087 0.8993
No log 98.6667 296 0.8090 0.0611 0.8090 0.8995
No log 99.3333 298 0.8091 0.1095 0.8091 0.8995
No log 100.0 300 0.8090 0.1095 0.8090 0.8994

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.4.0+cu118
  • Datasets 2.21.0
  • Tokenizers 0.19.1
Downloads last month
-
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for MayBashendy/ArabicNewSplits7_FineTuningAraBERT_noAug_task3_organization

Finetuned
(4019)
this model