che2

This model is a fine-tuned version of openai-community/gpt2 on an unknown dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

Training Loss	Epoch	Step	Validation Loss
No log	1.0	1	2.9747
No log	2.0	2	2.9277
No log	3.0	3	2.8863
No log	4.0	4	2.8503
No log	5.0	5	2.8192
No log	6.0	6	2.7933
No log	7.0	7	2.7725
No log	8.0	8	2.7572
No log	9.0	9	2.7469
No log	10.0	10	2.7417

Safetensors

Model size

0.1B params

Tensor type

F32

Base model

Finetuned

this model