Text Classification
Transformers
TensorBoard
Safetensors
bert
Generated from Trainer
text-embeddings-inference
Instructions to use BluSerK/bert-BBc-classifier with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use BluSerK/bert-BBc-classifier with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-classification", model="BluSerK/bert-BBc-classifier")# Load model directly from transformers import AutoTokenizer, AutoModelForSequenceClassification tokenizer = AutoTokenizer.from_pretrained("BluSerK/bert-BBc-classifier") model = AutoModelForSequenceClassification.from_pretrained("BluSerK/bert-BBc-classifier") - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -8,10 +8,21 @@ model-index:
|
|
| 8 |
- name: bert-BBc-classifier
|
| 9 |
results: []
|
| 10 |
---
|
|
|
|
| 11 |
|
| 12 |
-
|
| 13 |
-
should probably proofread and complete it, then remove this comment. -->
|
| 14 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 15 |
# bert-BBc-classifier
|
| 16 |
|
| 17 |
This model is a fine-tuned version of [google-bert/bert-base-uncased](https://huggingface.co/google-bert/bert-base-uncased) on the None dataset.
|
|
|
|
| 8 |
- name: bert-BBc-classifier
|
| 9 |
results: []
|
| 10 |
---
|
| 11 |
+
# BERT News Category Classifier
|
| 12 |
|
| 13 |
+
This model is a fine-tuned version of `bert-base-uncased` optimized to classify articles into 5 categories (Business, Tech, Politics, Sports, Entertainment).
|
|
|
|
| 14 |
|
| 15 |
+
## Model Description
|
| 16 |
+
* **Architecture:** BERT-base-uncased with frozen base layers for training efficiency.
|
| 17 |
+
* **Task:** Multi-class Text Classification (NLP Pipeline).
|
| 18 |
+
* **Performance:** Achieved a 0.96 Macro-F1 score on evaluation.
|
| 19 |
+
|
| 20 |
+
## Training and Evaluation Data
|
| 21 |
+
* **Dataset:** BBC News Dataset.
|
| 22 |
+
* **Preprocessing:** Cleaned text fields tokenized using the standard BERT WordPiece tokenizer.
|
| 23 |
+
|
| 24 |
+
## Intended Uses & Limitations
|
| 25 |
+
This model is intended for production-ready news classification pipelines. It is lightweight due to layer-freezing optimization during training.
|
| 26 |
# bert-BBc-classifier
|
| 27 |
|
| 28 |
This model is a fine-tuned version of [google-bert/bert-base-uncased](https://huggingface.co/google-bert/bert-base-uncased) on the None dataset.
|