QA_BERT_15_epoch

This model was trained from scratch on the None dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

Training Loss	Epoch	Step	Validation Loss
No log	1.0	2	5.4152
No log	2.0	4	4.5589
No log	3.0	6	4.0069
No log	4.0	8	3.7323
No log	5.0	10	3.6285
No log	6.0	12	3.5584
No log	7.0	14	3.4614
No log	8.0	16	3.4109
No log	9.0	18	3.3767
No log	10.0	20	3.3450
No log	11.0	22	3.3027
No log	12.0	24	3.2725
No log	13.0	26	3.2648
No log	14.0	28	3.2638
No log	15.0	30	3.2610

Safetensors

Model size

0.1B params

Tensor type

F32