gpt2_for_whole_train_result_2_2bce

This model is a fine-tuned version of gpt2 on an unknown dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

Training Loss	Epoch	Step	Validation Loss	Accuracy	F1
1.1142	6.8817	50	0.6346	0.671	0.7454
0.32	13.7634	100	0.1126	0.9545	0.9586
0.0531	20.6452	150	0.0443	0.9865	0.9874
0.0176	27.5269	200	0.0374	0.9905	0.9911
0.0074	34.4086	250	0.0362	0.992	0.9925
0.0036	41.2903	300	0.0325	0.994	0.9944
0.0021	48.1720	350	0.0375	0.9935	0.9939
0.0013	55.0538	400	0.0340	0.994	0.9944
0.0008	61.9355	450	0.0327	0.9945	0.9948
0.0011	68.8172	500	0.0391	0.994	0.9944
0.0006	75.6989	550	0.0389	0.995	0.9953
0.0014	82.5806	600	0.0409	0.993	0.9935
0.0045	89.4624	650	0.0443	0.9945	0.9948
0.0008	96.3441	700	0.0376	0.9945	0.9949

Safetensors

Model size

0.1B params

Tensor type

F32

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Finetuned

this model