distilbert-base-uncased-SpamFilter-DunnBC22

This model is a fine-tuned version of distilbert-base-uncased on the None dataset. It achieves the following results on the evaluation set:

Model description

This is a binary classification of whether the inputs are spam or not.

This model is intended to demonstrate my ability to solve a complex problem using technology.

The main limitation is the quality of the data source.

Input Word Length By Class:

Confusion Matrix:

The following hyperparameters were used during training:

Training Loss	Epoch	Step	Validation Loss	Accuracy	F1
0.5039	1.0	7	0.3920	0.8333	0.7576
0.3008	2.0	14	0.2010	0.9722	0.9719
0.113	3.0	21	0.1007	0.9907	0.9906

This model is a fine-tuned derivative of a pretrained model. Users must comply with the original model license.

This model was fine-tuned on third-party datasets which may have separate licenses or usage restrictions.