Function_words
Collection
Models for various function word manipulations
•
18 items
•
Updated
This model was trained from scratch on an unknown dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Accuracy |
|---|---|---|---|---|
| 5.0366 | 1.0 | 6439 | 4.8202 | 0.2746 |
| 4.4144 | 2.0 | 12878 | 4.2730 | 0.3176 |
| 4.1479 | 3.0 | 19317 | 4.0742 | 0.3366 |
| 4.0152 | 4.0 | 25756 | 3.9702 | 0.3473 |
| 3.9157 | 5.0 | 32195 | 3.9046 | 0.3545 |
| 3.8427 | 6.0 | 38634 | 3.8617 | 0.3588 |
| 3.7786 | 7.0 | 45073 | 3.8290 | 0.3639 |
| 3.7255 | 8.0 | 51512 | 3.8044 | 0.3661 |
| 3.6782 | 9.0 | 57951 | 3.7879 | 0.3691 |
| 3.6357 | 10.0 | 64390 | 3.7802 | 0.3704 |