ArabicNewSplits8_FineTuningAraBERT_noAug_task8_organization

This model is a fine-tuned version of aubmindlab/bert-base-arabertv02 on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 1.0004
  • Qwk: 0.2955
  • Mse: 1.0004
  • Rmse: 1.0002

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 100

Training results

Training Loss Epoch Step Validation Loss Qwk Mse Rmse
No log 0.6667 2 2.5117 -0.0807 2.5117 1.5848
No log 1.3333 4 2.0361 -0.0620 2.0361 1.4269
No log 2.0 6 0.8033 -0.1484 0.8033 0.8963
No log 2.6667 8 0.6657 0.1522 0.6657 0.8159
No log 3.3333 10 0.7476 0.1171 0.7476 0.8646
No log 4.0 12 0.8463 0.1212 0.8463 0.9200
No log 4.6667 14 1.1200 0.0540 1.1200 1.0583
No log 5.3333 16 0.9783 0.0604 0.9783 0.9891
No log 6.0 18 0.7732 0.1919 0.7732 0.8793
No log 6.6667 20 0.6523 0.1823 0.6523 0.8076
No log 7.3333 22 0.6895 -0.0801 0.6895 0.8304
No log 8.0 24 0.7802 0.0562 0.7802 0.8833
No log 8.6667 26 0.9955 0.2273 0.9955 0.9978
No log 9.3333 28 1.1632 0.2887 1.1632 1.0785
No log 10.0 30 1.1028 0.3104 1.1028 1.0501
No log 10.6667 32 0.8632 0.2121 0.8632 0.9291
No log 11.3333 34 0.6933 0.1897 0.6933 0.8326
No log 12.0 36 0.6854 0.1898 0.6854 0.8279
No log 12.6667 38 0.7182 0.3194 0.7182 0.8475
No log 13.3333 40 0.9164 0.2988 0.9164 0.9573
No log 14.0 42 0.9065 0.3357 0.9065 0.9521
No log 14.6667 44 0.7855 0.3113 0.7855 0.8863
No log 15.3333 46 0.6585 0.2997 0.6585 0.8115
No log 16.0 48 0.6990 0.2796 0.6990 0.8360
No log 16.6667 50 0.9037 0.2276 0.9037 0.9506
No log 17.3333 52 1.2544 0.2032 1.2544 1.1200
No log 18.0 54 1.2434 0.1964 1.2434 1.1151
No log 18.6667 56 1.0558 0.2189 1.0558 1.0275
No log 19.3333 58 0.8351 0.2353 0.8351 0.9138
No log 20.0 60 0.8223 0.2558 0.8223 0.9068
No log 20.6667 62 0.9749 0.2880 0.9749 0.9874
No log 21.3333 64 1.1415 0.3008 1.1415 1.0684
No log 22.0 66 0.9713 0.2918 0.9713 0.9855
No log 22.6667 68 0.7554 0.2069 0.7554 0.8692
No log 23.3333 70 0.6930 0.3096 0.6930 0.8325
No log 24.0 72 0.7932 0.2695 0.7932 0.8906
No log 24.6667 74 0.7840 0.2614 0.7840 0.8854
No log 25.3333 76 0.8342 0.2957 0.8342 0.9134
No log 26.0 78 0.8112 0.2827 0.8112 0.9007
No log 26.6667 80 0.8207 0.3113 0.8207 0.9059
No log 27.3333 82 0.7375 0.2280 0.7375 0.8588
No log 28.0 84 0.7279 0.2650 0.7279 0.8532
No log 28.6667 86 0.7244 0.2655 0.7244 0.8511
No log 29.3333 88 0.8306 0.3188 0.8306 0.9114
No log 30.0 90 0.9885 0.2878 0.9885 0.9943
No log 30.6667 92 0.8922 0.2917 0.8922 0.9446
No log 31.3333 94 0.9035 0.2917 0.9035 0.9505
No log 32.0 96 1.0021 0.2735 1.0021 1.0011
No log 32.6667 98 0.9690 0.2917 0.9690 0.9844
No log 33.3333 100 0.8474 0.2794 0.8474 0.9205
No log 34.0 102 0.8138 0.2794 0.8138 0.9021
No log 34.6667 104 0.7705 0.3337 0.7705 0.8778
No log 35.3333 106 0.7601 0.3724 0.7601 0.8718
No log 36.0 108 0.7928 0.3633 0.7928 0.8904
No log 36.6667 110 0.9021 0.2735 0.9021 0.9498
No log 37.3333 112 1.0088 0.2918 1.0088 1.0044
No log 38.0 114 1.1008 0.3169 1.1008 1.0492
No log 38.6667 116 1.0015 0.2807 1.0015 1.0008
No log 39.3333 118 0.8040 0.2435 0.8040 0.8967
No log 40.0 120 0.7497 0.2193 0.7497 0.8658
No log 40.6667 122 0.8077 0.2580 0.8077 0.8987
No log 41.3333 124 0.9472 0.2735 0.9472 0.9732
No log 42.0 126 1.0166 0.2360 1.0166 1.0083
No log 42.6667 128 0.9108 0.2568 0.9108 0.9544
No log 43.3333 130 0.8074 0.2778 0.8074 0.8986
No log 44.0 132 0.7930 0.2549 0.7930 0.8905
No log 44.6667 134 0.8844 0.2838 0.8844 0.9405
No log 45.3333 136 0.9994 0.2847 0.9994 0.9997
No log 46.0 138 0.9571 0.2807 0.9571 0.9783
No log 46.6667 140 0.8175 0.2661 0.8175 0.9041
No log 47.3333 142 0.7359 0.1789 0.7359 0.8578
No log 48.0 144 0.7433 0.2392 0.7433 0.8622
No log 48.6667 146 0.8684 0.2829 0.8684 0.9319
No log 49.3333 148 0.9718 0.2763 0.9718 0.9858
No log 50.0 150 0.9654 0.2763 0.9654 0.9826
No log 50.6667 152 0.9904 0.2361 0.9904 0.9952
No log 51.3333 154 0.9908 0.2361 0.9908 0.9954
No log 52.0 156 1.0190 0.2235 1.0190 1.0095
No log 52.6667 158 1.0193 0.2235 1.0193 1.0096
No log 53.3333 160 0.9817 0.2235 0.9817 0.9908
No log 54.0 162 0.9674 0.2763 0.9674 0.9835
No log 54.6667 164 0.9690 0.2763 0.9690 0.9844
No log 55.3333 166 0.9702 0.2763 0.9702 0.9850
No log 56.0 168 0.9545 0.2878 0.9545 0.9770
No log 56.6667 170 0.8396 0.2603 0.8396 0.9163
No log 57.3333 172 0.7622 0.2305 0.7622 0.8730
No log 58.0 174 0.7645 0.2219 0.7645 0.8744
No log 58.6667 176 0.7979 0.2743 0.7979 0.8933
No log 59.3333 178 0.8517 0.2713 0.8517 0.9229
No log 60.0 180 0.9212 0.2841 0.9212 0.9598
No log 60.6667 182 0.9409 0.2841 0.9409 0.9700
No log 61.3333 184 0.9640 0.2958 0.9640 0.9818
No log 62.0 186 0.9679 0.3036 0.9679 0.9838
No log 62.6667 188 0.9240 0.2794 0.9240 0.9613
No log 63.3333 190 0.8674 0.2873 0.8674 0.9313
No log 64.0 192 0.8299 0.2243 0.8299 0.9110
No log 64.6667 194 0.8511 0.2470 0.8511 0.9225
No log 65.3333 196 0.8475 0.2860 0.8475 0.9206
No log 66.0 198 0.8892 0.3113 0.8892 0.9430
No log 66.6667 200 0.9355 0.3257 0.9355 0.9672
No log 67.3333 202 0.9396 0.3257 0.9396 0.9693
No log 68.0 204 0.9777 0.3096 0.9777 0.9888
No log 68.6667 206 0.9740 0.3096 0.9740 0.9869
No log 69.3333 208 0.9540 0.2837 0.9540 0.9767
No log 70.0 210 0.9092 0.2992 0.9092 0.9535
No log 70.6667 212 0.8920 0.2716 0.8920 0.9445
No log 71.3333 214 0.8950 0.2716 0.8950 0.9460
No log 72.0 216 0.8813 0.2584 0.8813 0.9388
No log 72.6667 218 0.9206 0.3114 0.9206 0.9595
No log 73.3333 220 0.9686 0.2955 0.9686 0.9842
No log 74.0 222 0.9831 0.3063 0.9831 0.9915
No log 74.6667 224 0.9489 0.2955 0.9489 0.9741
No log 75.3333 226 0.8739 0.2794 0.8739 0.9348
No log 76.0 228 0.7898 0.2294 0.7898 0.8887
No log 76.6667 230 0.7504 0.2383 0.7504 0.8663
No log 77.3333 232 0.7555 0.2383 0.7555 0.8692
No log 78.0 234 0.7812 0.2538 0.7812 0.8838
No log 78.6667 236 0.7941 0.2538 0.7941 0.8911
No log 79.3333 238 0.8395 0.2778 0.8395 0.9162
No log 80.0 240 0.8695 0.2877 0.8695 0.9325
No log 80.6667 242 0.9130 0.3000 0.9130 0.9555
No log 81.3333 244 0.9249 0.3000 0.9249 0.9617
No log 82.0 246 0.9074 0.3000 0.9074 0.9526
No log 82.6667 248 0.8658 0.2743 0.8658 0.9305
No log 83.3333 250 0.8488 0.2743 0.8488 0.9213
No log 84.0 252 0.8631 0.2743 0.8631 0.9290
No log 84.6667 254 0.8688 0.2743 0.8688 0.9321
No log 85.3333 256 0.9024 0.3000 0.9024 0.9499
No log 86.0 258 0.9258 0.3000 0.9258 0.9622
No log 86.6667 260 0.9610 0.3144 0.9610 0.9803
No log 87.3333 262 1.0029 0.3245 1.0029 1.0015
No log 88.0 264 1.0071 0.3245 1.0071 1.0035
No log 88.6667 266 0.9814 0.3144 0.9814 0.9907
No log 89.3333 268 0.9567 0.2955 0.9567 0.9781
No log 90.0 270 0.9322 0.2955 0.9322 0.9655
No log 90.6667 272 0.9202 0.3114 0.9202 0.9593
No log 91.3333 274 0.9127 0.3114 0.9127 0.9554
No log 92.0 276 0.9228 0.2955 0.9228 0.9606
No log 92.6667 278 0.9361 0.2955 0.9361 0.9675
No log 93.3333 280 0.9555 0.2955 0.9555 0.9775
No log 94.0 282 0.9718 0.2955 0.9718 0.9858
No log 94.6667 284 0.9804 0.2955 0.9804 0.9902
No log 95.3333 286 0.9946 0.3144 0.9946 0.9973
No log 96.0 288 1.0082 0.3245 1.0082 1.0041
No log 96.6667 290 1.0183 0.3245 1.0183 1.0091
No log 97.3333 292 1.0184 0.3245 1.0184 1.0092
No log 98.0 294 1.0130 0.3245 1.0130 1.0065
No log 98.6667 296 1.0065 0.3144 1.0065 1.0033
No log 99.3333 298 1.0023 0.3144 1.0023 1.0011
No log 100.0 300 1.0004 0.2955 1.0004 1.0002

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.4.0+cu118
  • Datasets 2.21.0
  • Tokenizers 0.19.1
Downloads last month
1
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for MayBashendy/ArabicNewSplits8_FineTuningAraBERT_noAug_task8_organization

Finetuned
(4019)
this model