Thareesh Prabakaran commited on
Update README.md
Browse files
README.md
CHANGED
|
@@ -6,4 +6,56 @@ pipeline_tag: text-classification
|
|
| 6 |
tags:
|
| 7 |
- bbc-news-classification
|
| 8 |
- bert
|
| 9 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 6 |
tags:
|
| 7 |
- bbc-news-classification
|
| 8 |
- bert
|
| 9 |
+
---
|
| 10 |
+
|
| 11 |
+
# BBC News Classification with BERT
|
| 12 |
+
|
| 13 |
+
## Overview
|
| 14 |
+
|
| 15 |
+
This model classifies BBC news articles into five categories: Business, Entertainment, Politics, Sport, and Tech. It is fine-tuned using a pre-trained BERT model to achieve high accuracy in text classification.
|
| 16 |
+
|
| 17 |
+
## Dataset
|
| 18 |
+
|
| 19 |
+
The model is trained on the BBC News dataset, which consists of categorized news articles. The dataset contains training and test splits.
|
| 20 |
+
|
| 21 |
+
- Train Dataset: Contains labeled news articles.
|
| 22 |
+
|
| 23 |
+
- Test Dataset: Contains unlabeled news articles for evaluation.
|
| 24 |
+
|
| 25 |
+
## Model Details
|
| 26 |
+
|
| 27 |
+
- Base Model: bert-base-uncased
|
| 28 |
+
|
| 29 |
+
- Task: Text Classification
|
| 30 |
+
|
| 31 |
+
- Training Framework: TensorFlow with Hugging Face Transformers
|
| 32 |
+
|
| 33 |
+
- Fine-Tuning Epochs: 5
|
| 34 |
+
|
| 35 |
+
- Batch Size: 16
|
| 36 |
+
|
| 37 |
+
- Optimizer: Adam with Learning Rate Scheduling
|
| 38 |
+
|
| 39 |
+
## How to Use
|
| 40 |
+
|
| 41 |
+
You can use this model to classify BBC news articles into one of the five categories. Below is an example using the Hugging Face Transformers library:
|
| 42 |
+
|
| 43 |
+
```python
|
| 44 |
+
from transformers import pipeline
|
| 45 |
+
|
| 46 |
+
classifier = pipeline("text-classification", model="NotThareesh/BBC-News-Classifier-BERT")
|
| 47 |
+
|
| 48 |
+
text = "The stock market saw a significant rise today after tech companies reported high earnings."
|
| 49 |
+
result = classifier(text)
|
| 50 |
+
print(result) # Output: [{'label': 'Business', 'score': 0.98}]
|
| 51 |
+
```
|
| 52 |
+
|
| 53 |
+
## Model Performance
|
| 54 |
+
|
| 55 |
+
- Accuracy: Achieved over 99% accuracy on the validation dataset.
|
| 56 |
+
|
| 57 |
+
- Evaluation: The model performs consistently on Precision, Recall, and F1-score.
|
| 58 |
+
|
| 59 |
+
## License
|
| 60 |
+
|
| 61 |
+
This model is available under the MIT License.
|