stat

Files changed (8) hide show

README.md +145 -0
confusion_matrices.png +0 -0
experiment_summary.json +22 -0
metrics_summary.json +108 -0
model.pt +3 -0
per_class_metrics.png +0 -0
test_accuracy_comparison.png +0 -0
training_curves.png +0 -0

README.md ADDED Viewed

	@@ -0,0 +1,145 @@

+# Hindi Sentiment Analysis Model
+This repository contains a Hindi sentiment analysis model that can classify text into three categories: negative (neg), neutral (neu), and positive (pos). The model has been trained and evaluated using various BERT-based architectures, with XLM-RoBERTa showing the best performance.
+## Model Performance
+### Test Accuracy Comparison
+![Test Accuracy Comparison](./test_accuracy_comparison.png)
+Our extensive evaluation shows:
+- XLM-RoBERTa: 81.3%
+- mBERT: 76.5%
+- Custom-BERT-Attention: 74.9%
+- IndicBERT: 69.9%
+### Detailed Results
+#### Confusion Matrices
+![Confusion Matrices](./confusion_matrices.png)
+The confusion matrices show the prediction performance for each model:
+- XLM-RoBERTa shows the strongest performance with 82.1% accuracy on positive class
+- mBERT demonstrates balanced performance across classes
+- Custom-BERT-Attention maintains consistent performance
+- IndicBERT shows room for improvement in negative class detection
+#### Per-class Metrics
+![Per-class Metrics](./per_class_metrics.png)
+The detailed per-class metrics show:
+1. Precision:
+   - Positive class: Best performance across all models (~0.80-0.85)
+   - Neutral class: Consistent performance (~0.75-0.80)
+   - Negative class: More varied performance (~0.40-0.70)
+2. Recall:
+   - Positive class: High recall across models (~0.85-0.90)
+   - Neutral class: Moderate recall (~0.65-0.85)
+   - Negative class: Lower but improving recall (~0.25-0.60)
+3. F1-Score:
+   - Positive class: Best overall performance (~0.80-0.85)
+   - Neutral class: Good balance (~0.70-0.80)
+   - Negative class: Area for potential improvement (~0.30-0.65)
+### Training Progress
+![Training Progress](./training_progress.png)
+The training graphs show:
+- Consistent loss reduction across epochs
+- Stable validation accuracy improvement
+- No significant overfitting
+- XLM-RoBERTa achieving the best validation accuracy
+- Custom-BERT-Attention showing rapid initial learning
+## Model Usage
+```python
+from transformers import AutoModelForSequenceClassification, AutoTokenizer
+# Load the model and tokenizer
+tokenizer = AutoTokenizer.from_pretrained("madhav112/hindi-sentiment-analysis")
+model = AutoModelForSequenceClassification.from_pretrained("madhav112/hindi-sentiment-analysis")
+# Example usage
+text = "यह फिल्म बहुत अच्छी है"
+inputs = tokenizer(text, return_tensors="pt", padding=True, truncation=True)
+outputs = model(**inputs)
+predictions = outputs.logits.argmax(-1)
+```
+## Model Architecture
+The repository contains experiments with multiple BERT-based architectures:
+1. XLM-RoBERTa (Best performing)
+   - Highest overall accuracy
+   - Best performance on positive sentiment
+   - Strong cross-lingual capabilities
+2. mBERT
+   - Good balanced performance
+   - Strong on neutral class detection
+   - Consistent across all metrics
+3. Custom-BERT-Attention
+   - Competitive performance
+   - Quick convergence during training
+   - Good precision on positive class
+4. IndicBERT
+   - Baseline performance
+   - Room for improvement
+   - Better suited for specific Indian language tasks
+## Dataset
+The model was trained on a Hindi sentiment analysis dataset with three classes:
+- Positive (pos)
+- Neutral (neu)
+- Negative (neg)
+The confusion matrices show balanced class distribution and strong performance across categories.
+## Training Details
+The model was trained for 7 epochs with the following characteristics:
+- Learning rate: Optimized for each architecture
+- Batch size: Adjusted for optimal performance
+- Validation split: Regular evaluation during training
+- Early stopping: Monitored for best model selection
+- Loss function: Cross-entropy loss
+## Limitations
+- Lower performance on negative sentiment detection compared to positive
+- Neutral class classification shows moderate confusion with both positive and negative
+- Performance may vary on domain-specific text
+- Best suited for standard Hindi text; may have reduced performance on heavily colloquial or dialectal variations
+## Citation
+If you use this model in your research, please cite:
+```bibtex
+@misc{madhav2024hindisentiment,
+  author = {Madhav},
+  title = {Hindi Sentiment Analysis Model},
+  year = {2024},
+  publisher = {HuggingFace},
+  howpublished = {\url{https://huggingface.co/madhav112/hindi-sentiment-analysis}}
+}
+```
+## Author
+**Madhav**
+- HuggingFace: [madhav](https://huggingface.co/madhav)
+## License
+This project is licensed under the MIT License - see the LICENSE file for details.
+## Acknowledgments
+Special thanks to the HuggingFace team and the open-source community for providing the tools and frameworks that made this model possible.

confusion_matrices.png ADDED Viewed

experiment_summary.json ADDED Viewed

	@@ -0,0 +1,22 @@

+{
+    "best_model": "XLM-RoBERTa",
+    "best_accuracy": 81.33333333333333,
+    "model_rankings": [
+        [
+            "XLM-RoBERTa",
+            81.33333333333333
+        ],
+        [
+            "mBERT",
+            76.53333333333333
+        ],
+        [
+            "Custom-BERT-Attention",
+            74.93333333333334
+        ],
+        [
+            "IndicBERT",
+            69.86666666666666
+        ]
+    ]
+}

metrics_summary.json ADDED Viewed

	@@ -0,0 +1,108 @@

+{
+    "model_comparisons": {
+        "IndicBERT": {
+            "test_accuracy": 69.86666666666666,
+            "avg_precision": 0.6224180162184014,
+            "avg_recall": 0.5884580801343807,
+            "avg_f1": 0.593856658862321,
+            "per_class_metrics": {
+                "neg": {
+                    "precision": 0.4,
+                    "recall": 0.23076923076923078,
+                    "f1-score": 0.29268292682926833,
+                    "support": 52.0
+                },
+                "neu": {
+                    "precision": 0.7709923664122137,
+                    "recall": 0.6733333333333333,
+                    "f1-score": 0.7188612099644129,
+                    "support": 150.0
+                },
+                "pos": {
+                    "precision": 0.6962616822429907,
+                    "recall": 0.861271676300578,
+                    "f1-score": 0.7700258397932817,
+                    "support": 173.0
+                }
+            }
+        },
+        "mBERT": {
+            "test_accuracy": 76.53333333333333,
+            "avg_precision": 0.7711061102018549,
+            "avg_recall": 0.6763361493997332,
+            "avg_f1": 0.699825091252967,
+            "per_class_metrics": {
+                "neg": {
+                    "precision": 0.7692307692307693,
+                    "recall": 0.38461538461538464,
+                    "f1-score": 0.5128205128205128,
+                    "support": 52.0
+                },
+                "neu": {
+                    "precision": 0.8085106382978723,
+                    "recall": 0.76,
+                    "f1-score": 0.7835051546391754,
+                    "support": 150.0
+                },
+                "pos": {
+                    "precision": 0.7355769230769231,
+                    "recall": 0.884393063583815,
+                    "f1-score": 0.8031496062992126,
+                    "support": 173.0
+                }
+            }
+        },
+        "XLM-RoBERTa": {
+            "test_accuracy": 81.33333333333333,
+            "avg_precision": 0.8151709401709403,
+            "avg_recall": 0.7698423990909541,
+            "avg_f1": 0.7866802163819814,
+            "per_class_metrics": {
+                "neg": {
+                    "precision": 0.8205128205128205,
+                    "recall": 0.6153846153846154,
+                    "f1-score": 0.7032967032967034,
+                    "support": 52.0
+                },
+                "neu": {
+                    "precision": 0.7797619047619048,
+                    "recall": 0.8733333333333333,
+                    "f1-score": 0.8238993710691823,
+                    "support": 150.0
+                },
+                "pos": {
+                    "precision": 0.8452380952380952,
+                    "recall": 0.8208092485549133,
+                    "f1-score": 0.8328445747800586,
+                    "support": 173.0
+                }
+            }
+        },
+        "Custom-BERT-Attention": {
+            "test_accuracy": 74.93333333333334,
+            "avg_precision": 0.7866521381595728,
+            "avg_recall": 0.6839429870065707,
+            "avg_f1": 0.7130132766136565,
+            "per_class_metrics": {
+                "neg": {
+                    "precision": 0.8620689655172413,
+                    "recall": 0.4807692307692308,
+                    "f1-score": 0.6172839506172839,
+                    "support": 52.0
+                },
+                "neu": {
+                    "precision": 0.7862595419847328,
+                    "recall": 0.6866666666666666,
+                    "f1-score": 0.7330960854092526,
+                    "support": 150.0
+                },
+                "pos": {
+                    "precision": 0.7116279069767442,
+                    "recall": 0.884393063583815,
+                    "f1-score": 0.7886597938144329,
+                    "support": 173.0
+                }
+            }
+        }
+    }
+}

model.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:98d8bfa806feff9ff73b70df4b0a5a474f6c63f799d389fdcbf9d7fb782d481e
+size 1112250694

per_class_metrics.png ADDED Viewed

test_accuracy_comparison.png ADDED Viewed

training_curves.png ADDED Viewed