micro_base_help_tapt_pretrain_model

This model is a fine-tuned version of roberta-base on an unknown dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

Training Loss	Epoch	Step	Validation Loss
1.9109	0.99	40	1.6849
1.7421	2.0	81	1.6620
1.7411	2.99	121	1.6333
1.6441	4.0	162	1.6306
1.6337	4.99	202	1.6137
1.5774	6.0	243	1.6343
1.5997	6.99	283	1.5931
1.5196	8.0	324	1.6018
1.5416	8.99	364	1.5994
1.4819	10.0	405	1.5886
1.5079	10.99	445	1.5938
1.455	12.0	486	1.5699
1.4718	12.99	526	1.5947
1.4157	14.0	567	1.5920
1.4369	14.99	607	1.5879
1.3733	16.0	648	1.5745
1.4017	16.99	688	1.6000
1.3601	18.0	729	1.5830
1.3602	18.99	769	1.5846
1.3152	20.0	810	1.5940
1.3437	20.99	850	1.5942
1.2904	22.0	891	1.5787

Safetensors

Model size

0.1B params

Tensor type

F32

Base model

Finetuned

this model

Finetunes