Hums003 commited on Jan 9

Commit

66bf654

verified ·

1 Parent(s): eedefb6

Upload DistilBERT IMDB sentiment model and results

Browse files

Files changed (19) hide show

.gitattributes +3 -0
MODEL_CARD.md +34 -0
README.md +73 -0
best_model.pt +3 -0
checkpoint_epoch_1.pt +3 -0
checkpoint_epoch_2.pt +3 -0
checkpoint_epoch_3.pt +3 -0
config.json +1 -0
confusion_matrix.png +0 -0
final_results.json +14 -0
model.safetensors +3 -0
special_tokens_map.json +7 -0
test_data.csv +3 -0
tokenizer.json +0 -0
tokenizer_config.json +56 -0
train_data.csv +3 -0
training_history.csv +4 -0
training_history.png +3 -0
vocab.txt +0 -0

.gitattributes CHANGED Viewed

@@ -33,3 +33,6 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+test_data.csv filter=lfs diff=lfs merge=lfs -text
+train_data.csv filter=lfs diff=lfs merge=lfs -text
+training_history.png filter=lfs diff=lfs merge=lfs -text

MODEL_CARD.md ADDED Viewed

	@@ -0,0 +1,34 @@

+# Sentiment Analysis Model Card
+## Model Description
+- **Base Model**: distilbert-base-uncased
+- **Task**: Binary Sentiment Classification (Positive/Negative)
+- **Dataset**: IMDB Movie Reviews
+- **Training Samples**: 16,000
+- **Validation Samples**: 4,000
+- **Test Samples**: 5,000
+## Performance
+- **Test Accuracy**: 0.9460
+- **Test F1 Score**: 0.9723
+- **Best Validation Accuracy**: 0.9300
+## Training Details
+- **Epochs**: 3
+- **Batch Size**: 16
+- **Learning Rate**: 2e-05
+- **Max Sequence Length**: 512
+- **Optimizer**: AdamW with weight decay
+- **Scheduler**: Linear with warmup
+## Model Size
+- **Total Parameters**: 66,955,010
+- **Trainable Parameters**: 66,955,010
+- **Frozen Parameters**: 0
+## Explainability Features
+- ✅ Attention weights available
+- ✅ Hidden states available
+- ✅ Compatible with LIME
+- ✅ Compatible with Integrated Gradients

README.md ADDED Viewed

	@@ -0,0 +1,73 @@

+---
+language: en
+tags:
+- sentiment-analysis
+- imdb
+- distilbert
+- transformers
+license: apache-2.0
+datasets:
+- imdb
+---
+# DistilBERT Sentiment Analysis Model
+This model is a fine-tuned version of `distilbert-base-uncased` for binary sentiment classification on the IMDB movie reviews dataset.
+## Model Details
+### Model Description
+- **Model type**: DistilBERT (transformer-based)
+- **Task**: Binary sentiment classification (positive/negative)
+- **Base Model**: `distilbert-base-uncased`
+- **Language**: English
+### Training Details
+#### Training Data
+- **Dataset**: IMDB Movie Reviews
+- **Training Samples**: 16,000
+- **Validation Samples**: 4,000
+- **Test Samples**: 5,000
+- **Class Distribution**: 50% positive, 50% negative
+#### Training Procedure
+- **Epochs**: 3
+- **Batch Size**: 16
+- **Learning Rate**: 2e-05
+- **Max Sequence Length**: 512
+- **Optimizer**: AdamW with weight decay (0.01)
+- **Scheduler**: Linear with 10% warmup
+#### Evaluation Results
+- **Test Accuracy**: 0.9460
+- **Test F1 Score**: 0.9723
+- **Best Validation Accuracy**: 0.9300
+- **Training Time**: ~6 minutes on Google Colab T4 GPU
+## How to Use
+### Direct Inference
+```python
+from transformers import AutoTokenizer, AutoModelForSequenceClassification
+import torch
+# Load model and tokenizer
+model_name = "Hums003/distilbert-imdb-sentiment"
+tokenizer = AutoTokenizer.from_pretrained(model_name)
+model = AutoModelForSequenceClassification.from_pretrained(model_name)
+# Prepare text
+text = "This movie was absolutely fantastic! I loved every minute of it."
+inputs = tokenizer(text, return_tensors="pt", truncation=True, max_length=512)
+# Get predictions
+with torch.no_grad():
+    outputs = model(**inputs)
+    predictions = torch.nn.functional.softmax(outputs.logits, dim=-1)
+# Interpret results
+sentiment = "positive" if predictions[0][1] > 0.5 else "negative"
+confidence = predictions[0][1].item() if predictions[0][1] > 0.5 else predictions[0][0].item()
+print(f"Sentiment: {sentiment} (confidence: {confidence:.2%})")
+```

best_model.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4135587a3b2cced4cf24cb7bdfce6550319454e29e04b614d19f8d325a089c2b
+size 267863289

checkpoint_epoch_1.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:76c882db6516246e9cd39459a08ad8daa156ffa5a4de9f802fd48adce97aefe9
+size 803596065

checkpoint_epoch_2.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:0dae9ddc31cb82d31e98c08ff0e72e6c10391aef7b33dd2aa79f4bc9773e1700
+size 803596065

checkpoint_epoch_3.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5199ce79842d687a18597bc7032834b14e12e2ce2d7972c85d14b5a7c43f9c55
+size 803596065

config.json ADDED Viewed

	@@ -0,0 +1 @@


1	+ {}

confusion_matrix.png ADDED Viewed

final_results.json ADDED Viewed

	@@ -0,0 +1,14 @@

+{
+  "test_accuracy": 0.946,
+  "test_f1": 0.9722507708119219,
+  "best_val_accuracy": 0.93,
+  "best_epoch": 3,
+  "total_parameters": 66955010,
+  "trainable_parameters": 66955010,
+  "training_samples": 16000,
+  "test_samples": 5000,
+  "epochs_trained": 3,
+  "batch_size": 16,
+  "learning_rate": 2e-05,
+  "max_length": 512
+}

model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:bea2b246f5d9f7b9a685301de1e7c67118d91662dc5f5bb7b02f3f3075cbba86
+size 267832560

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,7 @@

+{
+  "cls_token": "[CLS]",
+  "mask_token": "[MASK]",
+  "pad_token": "[PAD]",
+  "sep_token": "[SEP]",
+  "unk_token": "[UNK]"
+}

test_data.csv ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c184d0f5f855a6637056dbff925779497b31363bc2b63e05c98228b94db91cb6
+size 32308848

tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,56 @@

+{
+  "added_tokens_decoder": {
+    "0": {
+      "content": "[PAD]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "100": {
+      "content": "[UNK]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "101": {
+      "content": "[CLS]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "102": {
+      "content": "[SEP]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "103": {
+      "content": "[MASK]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    }
+  },
+  "clean_up_tokenization_spaces": false,
+  "cls_token": "[CLS]",
+  "do_lower_case": true,
+  "extra_special_tokens": {},
+  "mask_token": "[MASK]",
+  "model_max_length": 512,
+  "pad_token": "[PAD]",
+  "sep_token": "[SEP]",
+  "strip_accents": null,
+  "tokenize_chinese_chars": true,
+  "tokenizer_class": "DistilBertTokenizer",
+  "unk_token": "[UNK]"
+}

train_data.csv ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:0ba8ebadc77f655877928edf3fcd58f4c69b0f7ffe54ac3a377322d5c44e6927
+size 33226811

training_history.csv ADDED Viewed

	@@ -0,0 +1,4 @@

+epoch,train_loss,val_loss,val_accuracy,val_f1
+1,0.3066180157214403,0.21175865678861738,0.92525,0.9249337617530204
+2,0.15920162493363021,0.2598572481777519,0.91825,0.9188085103584
+3,0.08946884938236326,0.2924477435983717,0.93,0.9299135622176197

training_history.png ADDED Viewed

Git LFS Details

SHA256: dd4d7aec748384c048bb0a1ab013796396bb0c2a911693150cd9ca5d246905cd
Pointer size: 131 Bytes
Size of remote file: 135 kB

vocab.txt ADDED Viewed

The diff for this file is too large to render. See raw diff