|
|
|
|
|
--- |
|
|
language: fa |
|
|
library_name: transformers |
|
|
tags: |
|
|
- classification |
|
|
- legal |
|
|
- iranian-legal |
|
|
- persian |
|
|
- case-type |
|
|
pipeline_tag: text-classification |
|
|
--- |
|
|
|
|
|
# QomSSLab/CaseTypeClassifier-fa |
|
|
|
|
|
**QomSSLab/CaseTypeClassifier-fa** is a Persian legal text classifier that predicts whether a court ruling (رأی) belongs to a **civil (حقوقی)** or **criminal (کیفری)** category. |
|
|
The model is designed for use in Iranian legal NLP pipelines, document organization, and downstream analysis of judicial data. |
|
|
|
|
|
## 💡 Use Cases |
|
|
|
|
|
- Automatic classification of Persian court rulings into civil or criminal categories. |
|
|
- Preprocessing step for legal analytics and document retrieval systems. |
|
|
- Assisting legal researchers and developers in structuring Persian legal corpora. |
|
|
|
|
|
## 🧠 Model Details |
|
|
|
|
|
- **Language**: Persian (Farsi) |
|
|
- **Task**: Text Classification |
|
|
- **Classes**: `civil` (حقوقی), `criminal` (کیفری) |
|
|
- **Pipeline Tag**: `text-classification` |
|
|
|
|
|
## 📦 Example Usage |
|
|
|
|
|
```python |
|
|
from transformers import AutoTokenizer, AutoModelForSequenceClassification, pipeline |
|
|
|
|
|
model_name = "QomSSLab/CaseTypeClassifier-fa" |
|
|
tokenizer = AutoTokenizer.from_pretrained(model_name) |
|
|
model = AutoModelForSequenceClassification.from_pretrained(model_name) |
|
|
|
|
|
classifier = pipeline("text-classification", model=model, tokenizer=tokenizer) |
|
|
|
|
|
text = "در این پرونده متهم به سرقت اموال عمومی محکوم شده است." |
|
|
result = classifier(text) |
|
|
|
|
|
print(result) |
|
|
``` |
|
|
|
|
|
Example Output: |
|
|
```python |
|
|
[ |
|
|
{'label': 'کیفری', 'score': 0.9969141483306885} |
|
|
] |
|
|
``` |
|
|
|
|
|
## 📊 Evaluation |
|
|
|
|
|
The model was trained and evaluated on a balanced dataset of Persian court rulings. |
|
|
It demonstrates high accuracy in distinguishing civil and criminal judgments. |
|
|
|
|
|
| Metric | Value | |
|
|
|:-------|:------:| |
|
|
| **Training Loss** | 0.0358 | |
|
|
| **Validation Loss** | 0.033996 | |
|
|
| **Accuracy** | **0.9951** | |
|
|
| **F1 Score** | **0.9951** | |
|
|
| **Precision** | **0.9951** | |
|
|
| **Recall** | **0.9951** | |
|
|
|
|
|
✅ **Final Performance:** The model achieved **99.51% accuracy** and **0.9951 F1-score** on the validation set. |
|
|
|
|
|
### Limitations |
|
|
|
|
|
- Performance may degrade on highly abbreviated or informal texts. |
|
|
- Designed primarily for Iranian legal language; may not generalize to non-Iranian legal contexts. |
|
|
- Does not classify subtypes (e.g., family, property, or financial cases). |
|
|
|
|
|
|