Tajik Language Models
Collection
20 items
•
Updated
•
3
This model is a fine-tuned version of gpt2 on the None dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 7.1107 | 1.0 | 2405 | 6.9547 |
| 6.7012 | 2.0 | 4810 | 6.6086 |
| 6.5467 | 3.0 | 7215 | 6.5076 |