square1.2 / README.md

End of training

948658c verified over 1 year ago

1.76 kB

library_name: transformers
license: mit
base_model: jla25/results
tags:
  - generated_from_trainer
model-index:
  - name: square1.2
    results: []

square1.2

This model is a fine-tuned version of jla25/results on an unknown dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

Training Loss	Epoch	Step	Validation Loss
No log	1.0	75	0.0286
No log	2.0	150	0.0172
No log	3.0	225	0.0129
No log	4.0	300	0.0103
No log	5.0	375	0.0088
No log	6.0	450	0.0082
0.0187	7.0	525	0.0078
0.0187	8.0	600	0.0076
0.0187	9.0	675	0.0074
0.0187	10.0	750	0.0074