yes_no_model / README.md

End of training

c095fce verified over 1 year ago

1.73 kB

metadata

license: mit
base_model: gpt2
tags:
  - generated_from_trainer
model-index:
  - name: yes_no_model
    results: []

yes_no_model

This model is a fine-tuned version of gpt2 on an unknown dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

Training Loss	Epoch	Step	Validation Loss
6.4778	0.2857	10	5.2663
2.9544	0.5714	20	1.8252
1.267	0.8571	30	0.7955
0.7249	1.1429	40	0.3101
0.3031	1.4286	50	0.0483
0.0606	1.7143	60	0.0010
0.0049	2.0	70	0.0001
0.0014	2.2857	80	0.0001
0.0005	2.5714	90	0.0000
0.0008	2.8571	100	0.0000