Bert-base-cased Fine Tuned Glue Mrpc Demo

This checkpoint was initialized from the pre-trained checkpoint bert-base-cased and subsequently fine-tuned on GLUE task: mrpc using this notebook. Training was conducted for 3 epochs, using a linear decaying learning rate of 2e-05, and a total batch size of 32.

The model has a final training loss of 0.103 and a accuracy of 0.831.

Downloads last month
3
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train patrickvonplaten/bert-base-cased_fine_tuned_glue_mrpc_demo