PoSH-Bench
Collection
This collection contains the models I trained for the PoSH-Bench paper • 44 items • Updated
This model was trained from scratch on an unknown dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Accuracy |
|---|---|---|---|---|
| 8.283 | 0.43 | 2000 | 8.1606 | 0.1193 |
| 5.6979 | 0.87 | 4000 | 6.2471 | 0.1794 |
| 4.7397 | 1.3 | 6000 | 5.5927 | 0.2038 |
| 4.4648 | 1.73 | 8000 | 5.3240 | 0.2134 |
| 4.304 | 2.17 | 10000 | 5.1259 | 0.2224 |
| 4.1746 | 2.6 | 12000 | 4.9563 | 0.2346 |
| 4.0534 | 3.03 | 14000 | 4.8167 | 0.2452 |
| 3.9553 | 3.47 | 16000 | 4.6929 | 0.2529 |
| 3.8577 | 3.9 | 18000 | 4.5889 | 0.2604 |
| 3.7753 | 4.33 | 20000 | 4.5094 | 0.2662 |
| 3.6998 | 4.77 | 22000 | 4.4154 | 0.2721 |
| 3.6369 | 5.2 | 24000 | 4.3482 | 0.2773 |
| 3.5876 | 5.64 | 26000 | 4.2805 | 0.2822 |
| 3.5287 | 6.07 | 28000 | 4.2261 | 0.2864 |
| 3.4839 | 6.5 | 30000 | 4.1755 | 0.2907 |
| 3.4458 | 6.94 | 32000 | 4.1339 | 0.2953 |
| 3.4049 | 7.37 | 34000 | 4.1012 | 0.2991 |
| 3.369 | 7.8 | 36000 | 4.0649 | 0.3019 |
| 3.3284 | 8.24 | 38000 | 4.0342 | 0.3044 |
| 3.316 | 8.67 | 40000 | 4.0054 | 0.3075 |
| 3.2823 | 9.1 | 42000 | 3.9841 | 0.3102 |
| 3.258 | 9.54 | 44000 | 3.9635 | 0.3127 |
| 3.2452 | 9.97 | 46000 | 3.9409 | 0.3150 |
| 3.2171 | 10.4 | 48000 | 3.9224 | 0.3161 |
| 3.2129 | 10.84 | 50000 | 3.9127 | 0.3176 |
| 3.1892 | 11.27 | 52000 | 3.8949 | 0.3196 |
| 3.1881 | 11.7 | 54000 | 3.8819 | 0.3205 |
| 3.1666 | 12.14 | 56000 | 3.8752 | 0.3220 |
| 3.1579 | 12.57 | 58000 | 3.8623 | 0.3230 |
| 3.167 | 13.0 | 60000 | 3.8587 | 0.3239 |
| 3.1434 | 13.44 | 62000 | 3.8529 | 0.3233 |
| 3.1465 | 13.87 | 64000 | 3.8404 | 0.3256 |
| 3.1154 | 14.3 | 66000 | 3.8405 | 0.3253 |
| 3.1263 | 14.74 | 68000 | 3.8275 | 0.3269 |
| 3.1178 | 15.17 | 70000 | 3.8242 | 0.3271 |
| 3.1115 | 15.6 | 72000 | 3.8151 | 0.3280 |
| 3.1085 | 16.04 | 74000 | 3.8117 | 0.3287 |
| 3.0968 | 16.47 | 76000 | 3.8081 | 0.3288 |
| 3.1017 | 16.91 | 78000 | 3.8005 | 0.3298 |
| 3.0839 | 17.34 | 80000 | 3.8010 | 0.3296 |
| 3.0855 | 17.77 | 82000 | 3.7976 | 0.3305 |
| 3.0794 | 18.21 | 84000 | 3.7902 | 0.3313 |
| 3.0728 | 18.64 | 86000 | 3.7906 | 0.3309 |
| 3.0754 | 19.07 | 88000 | 3.7862 | 0.3313 |
| 3.0675 | 19.51 | 90000 | 3.7835 | 0.3321 |
| 3.0707 | 19.94 | 92000 | 3.7769 | 0.3330 |
| 3.0606 | 20.37 | 94000 | 3.7770 | 0.3324 |
| 3.0614 | 20.81 | 96000 | 3.7753 | 0.3327 |
| 3.0507 | 21.24 | 98000 | 3.7728 | 0.3330 |
| 3.0538 | 21.67 | 100000 | 3.7715 | 0.3332 |