Function_words
Collection
Models for various function word manipulations
•
18 items
•
Updated
This model was trained from scratch on an unknown dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Accuracy |
|---|---|---|---|---|
| 5.0339 | 1.0 | 6439 | 4.8144 | 0.2765 |
| 4.4123 | 2.0 | 12878 | 4.2657 | 0.3180 |
| 4.1405 | 3.0 | 19317 | 4.0689 | 0.3368 |
| 4.011 | 4.0 | 25756 | 3.9679 | 0.3468 |
| 3.9116 | 5.0 | 32195 | 3.9068 | 0.3538 |
| 3.84 | 6.0 | 38634 | 3.8596 | 0.3597 |
| 3.7769 | 7.0 | 45073 | 3.8291 | 0.3634 |
| 3.7239 | 8.0 | 51512 | 3.8062 | 0.3666 |
| 3.6741 | 9.0 | 57951 | 3.7892 | 0.3687 |
| 3.632 | 10.0 | 64390 | 3.7814 | 0.3704 |