gpt2_m050_tiny-stories_1024_dpos

This model is a fine-tuned version of on the roneneldan/TinyStories dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

Training Loss	Epoch	Step	Validation Loss	Accuracy
2.9069	0.0525	1000	2.4458	0.4478
1.9731	0.1049	2000	1.7931	0.5686
1.7234	0.1574	3000	1.6119	0.6009
1.6063	0.2099	4000	1.5118	0.6192
1.5331	0.2623	5000	1.4537	0.6299
1.4812	0.3148	6000	1.4059	0.6392
1.4428	0.3672	7000	1.3720	0.6457
1.4149	0.4197	8000	1.3438	0.6510
1.3857	0.4722	9000	1.3179	0.6564
1.3654	0.5246	10000	1.2988	0.6600
1.3449	0.5771	11000	1.2830	0.6630
1.3302	0.6296	12000	1.2688	0.6660
1.3174	0.6820	13000	1.2575	0.6683
1.3052	0.7345	14000	1.2457	0.6708
1.2959	0.7869	15000	1.2371	0.6725
1.2847	0.8394	16000	1.2278	0.6743
1.28	0.8919	17000	1.2206	0.6759
1.27	0.9443	18000	1.2162	0.6768
1.272	0.9968	19000	1.2129	0.6776

Safetensors

Model size

0.1B params

Tensor type

F32

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support