File size: 3,316 Bytes
db35401 98b0c3d d617ddb db35401 d617ddb 98b0c3d d617ddb db35401 d617ddb db35401 98b0c3d db35401 98b0c3d db35401 98b0c3d d617ddb 98b0c3d db35401 d617ddb db35401 d617ddb db35401 d617ddb 98b0c3d d617ddb 98b0c3d d617ddb 98b0c3d db35401 d617ddb 98b0c3d d617ddb 98b0c3d d617ddb 98b0c3d d617ddb 98b0c3d d617ddb 98b0c3d d617ddb 98b0c3d d617ddb 98b0c3d d617ddb 98b0c3d d617ddb 98b0c3d d617ddb 98b0c3d d617ddb 98b0c3d d617ddb 98b0c3d d617ddb 98b0c3d d617ddb 98b0c3d d617ddb 98b0c3d d617ddb |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 |
---
language:
- vi
tags:
- hate-speech-detection
- vietnamese-nlp
- text-classification
- offensive-speech
license: mit
datasets:
- vihsd
base_model: vinai/bartpho-syllable-base
---
# BARTPHO
BARTpho fine-tuned cho bài toán phân loại Hate Speech tiếng Việt.
## Model Details
- **Model type**: Fine-tuned transformer model
- **Architecture**: BARTpho (Bidirectional and Auto-Regressive Transformer cho tiếng Việt)
- **Base model**: [vinai/bartpho-syllable-base](https://huggingface.co/vinai/bartpho-syllable-base)
- **Task**: Hate Speech Classification
- **Language**: Vietnamese
- **Labels**: CLEAN (0), OFFENSIVE (1), HATE (2)
## 📊 Model Performance
| Metric | Score |
|--------|-------|
| Accuracy | 0.8985 |
| F1 Macro | 0.6791 |
| F1 Weighted | 0.8886 |
## Model Description
BARTpho fine-tuned cho bài toán phân loại Hate Speech tiếng Việt. Model này được fine-tune từ `vinai/bartpho-syllable-base` trên dataset ViHSD (Vietnamese Hate Speech Dataset).
## How to Use
### Basic Usage
```python
from transformers import AutoTokenizer, AutoModelForSequenceClassification
import torch
# Load model and tokenizer
model_name = "visolex/hate-speech-bartpho"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForSequenceClassification.from_pretrained(model_name)
# Classify text
text = "Văn bản tiếng Việt cần phân loại"
inputs = tokenizer(text, return_tensors="pt", padding=True, truncation=True)
with torch.no_grad():
outputs = model(**inputs)
predictions = torch.nn.functional.softmax(outputs.logits, dim=-1)
predicted_label = torch.argmax(predictions, dim=-1).item()
# Label mapping
label_names = {
0: "CLEAN",
1: "OFFENSIVE",
2: "HATE"
}
print(f"Predicted label: {label_names[predicted_label]}")
print(f"Confidence scores: {predictions[0].tolist()}")
```
### Using the Pipeline
```python
from transformers import pipeline
classifier = pipeline(
"text-classification",
model="visolex/hate-speech-bartpho",
tokenizer="visolex/hate-speech-bartpho"
)
result = classifier("Văn bản tiếng Việt cần phân loại")
print(result)
```
## Training Details
### Training Data
- Dataset: ViHSD (Vietnamese Hate Speech Dataset)
- Training samples: ~8,000 samples
- Validation samples: ~1,000 samples
- Test samples: ~1,000 samples
### Training Procedure
- Framework: PyTorch + Transformers
- Optimizer: AdamW
- Learning Rate: 2e-5
- Batch Size: 32
- Epochs: Varies by model
- Max Sequence Length: 256
### Label Distribution
- CLEAN (0): Normal content without offensive language
- OFFENSIVE (1): Mildly offensive content
- HATE (2): Hate speech and extremist language
## Evaluation
Model được đánh giá trên test set của ViHSD với các metrics:
- Accuracy: Overall classification accuracy
- F1 Macro: Macro-averaged F1 score across all labels
- F1 Weighted: Weighted F1 score based on label frequency
## Limitations and Bias
- Model chỉ được train trên dữ liệu tiếng Việt từ mạng xã hội
- Performance có thể giảm trên domain khác (email, document, etc.)
- Model có thể có bias từ dữ liệu training
- Cần đánh giá thêm trên dữ liệu real-world
## Citation
## Contact
## License
This model is distributed under the MIT License.
|