BERT Email Spam Classifier

This repository contains a fine-tuned BERT model for SMS/email spam detection using the SMS Spam Collection Dataset.

Accuracy:

Loss: 0.012 Accuracy: 0.98

Model Details

Architecture: BERT (Bidirectional Encoder Representations from Transformers)
Task: Binary classification (spam vs. ham)
Base Model: bert-base-uncased
Framework: Hugging Face Transformers (PyTorch)
License: MIT

Usage

Load the model and tokenizer using Hugging Face Transformers:

from transformers import AutoTokenizer, AutoModelForSequenceClassification

tokenizer = AutoTokenizer.from_pretrained("SGHOSH1999/bert-email-spam-classifier_tuned")
model = AutoModelForSequenceClassification.from_pretrained("SGHOSH1999/bert-email-spam-classifier_tuned")

Training Data

SMS Spam Collection Dataset
The dataset contains 5,574 SMS messages labeled as 'ham' (legitimate) or 'spam'.

How to Cite

If you use this model, please cite the original dataset and this repository.

License

MIT

Downloads last month: 331

Safetensors

Model size

0.1B params

Tensor type

F32

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

SGHOSH1999
/

bert-email-spam-classifier_tuned