BBC News Classification with BERT

Overview

This model classifies BBC news articles into five categories: Business, Entertainment, Politics, Sport, and Tech. It is fine-tuned using a pre-trained BERT model to achieve high accuracy in text classification.

Dataset

The model is trained on the BBC News dataset, which consists of categorized news articles. The dataset contains training and test splits.

  • Train Dataset: Contains labeled news articles.
  • Test Dataset: Contains unlabeled news articles for evaluation.

Model Details

  • Base Model: bert-base-uncased
  • Training Framework: PyTorch with Hugging Face Transformers
  • Batch Size: 16
  • Optimizer: Adam with Learning Rate Scheduling

How to Use

You can use this model to classify BBC news articles into one of the five categories. Below is an example using the Hugging Face Transformers library:

from transformers import pipeline

classifier = pipeline("text-classification", model="NotThareesh/BBC-News-Classifier-BERT")

labels = {"LABEL_0": "Business", "LABEL_1": "Entertainment", "LABEL_2": "Politics", "LABEL_3": "Sport", "LABEL_4": "Tech"}

text = "The stock market saw a significant rise today after tech companies reported high earnings."
result = classifier(text)

print(f"Predicted Label: {labels[result[0]['label']]}, Accuracy Score: {result[0]['score']}")  # Predicted Label: Business, Accuracy: 0.996

Model Performance

  • Accuracy: Achieved over 99% accuracy on the validation dataset.
  • Evaluation: The model performs consistently on Precision, Recall, and F1-score.

License

This model is available under the MIT License.

Downloads last month
8
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ItsThareesh/BBC-News-Classifier-BERT

Finetuned
(6377)
this model