BERT-base-uncased fine-tuned on SST-2 (GLUE)

This repository contains a bert-base-uncased model fine-tuned for binary sentiment classification on the GLUE/SST-2 dataset.

Model summary

Task: sentiment analysis (binary classification)
Labels: negative (0), positive (1)
Base model: bert-base-uncased
Library: Transformers (Trainer API)
Note: In the training notebook, the model was fine-tuned on a small subset (640 train / 640 validation) for demonstration purposes. For production use, fine-tune on the full dataset and validate thoroughly.

High-stakes or safety-critical decisions (medical, legal, hiring, etc.)
Domains significantly different from SST-2 (e.g., clinical notes, finance news) without further fine-tuning
Non-English text (model and data are English-focused)

Dataset bias: SST-2 reflects movie review sentiment distribution and language patterns; performance may degrade on other domains.
Small fine-tuning subset: if you trained on 640 samples, results are not representative of the full SST-2 benchmark.
Short-text behavior: very short/ambiguous or sarcastic statements can be misclassified.
Offensive/toxic content: the model may output confident predictions on harmful text; it does not provide safety filtering.

Fine-tuning used the GLUE benchmark dataset configuration SST-2 (Stanford Sentiment Treebank v2 as used in GLUE).

In the provided Colab:

(Optional: add confusion matrix, F1, etc. if available)

Base model

Finetuned

this model