Synonym ARXIV Models
Collection
3 items • Updated
This model is a fine-tuned version of meta-llama/Meta-Llama-3-8B on an arxiv abstracts dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Accuracy |
|---|---|---|---|---|
| 1.1013 | 0.8768 | 500 | 3.1391 | 0.5853 |
| 0.8318 | 1.7536 | 1000 | 2.4346 | 0.6784 |
| 0.4265 | 2.6304 | 1500 | 2.5032 | 0.6838 |
Base model
meta-llama/Meta-Llama-3-8B