H-Layba's picture
Update README.md
9098742 verified
|
Raw
History Blame Contribute Delete
1.83 kB
---
language:
- ur
license: apache-2.0
tags:
- text-classification
- sentiment-analysis
- urdu
- bert
- fine-tuned
- nlp
datasets:
- mirfan899/imdb_urdu_reviews
metrics:
- accuracy
- f1
model-index:
- name: urdu-sentiment-classifier
results:
- task:
type: text-classification
dataset:
name: IMDB Urdu Reviews
type: mirfan899/imdb_urdu_reviews
metrics:
- type: accuracy
value: 0.81
- type: f1
value: 0.8098
---
# Urdu Sentiment Classifier 🇵🇰
A fine-tuned **bert-base-multilingual-cased** model for **Urdu sentiment analysis** — classifying Urdu text as positive or negative.
## Live Demo
[Try it on HuggingFace Spaces](https://huggingface.co/spaces/H-Layba/urdu-sentiment-classifier)
## Performance
| Metric | Score |
|--------|-------|
| Accuracy | 81.00% |
| F1 Score (weighted) | 0.8098 |
## Example Predictions
```python
from transformers import pipeline
classifier = pipeline("text-classification", model="H-Layba/urdu-sentiment-classifier")
classifier("یہ فلم بہت اچھی تھی")
# [{'label': 'positive', 'score': 0.9936}]
classifier("آج کا دن بہت برا تھا")
# [{'label': 'negative', 'score': 0.9918}]
```
## Training Details
- **Base model:** bert-base-multilingual-cased
- **Dataset:** 50,000 Urdu movie reviews
- **Epochs:** 5
- **Learning rate:** 2e-5
- **Batch size:** 32 (train), 64 (eval)
- **Hardware:** Kaggle T4 GPU
- **Mixed precision:** fp16
## Dataset
Trained on `mirfan899/imdb_urdu_reviews` — 50,000 Urdu translations of IMDB movie reviews with positive/negative sentiment labels.
## Part of Urdu NLP Suite
This model is part of a larger collection of fine-tuned Urdu NLP models:
- [x] Sentiment Classification ← this model
- [ ] Text Summarization
- [ ] Question Answering
- [ ] Urdu → English Translation