File size: 3,316 Bytes

db35401
98b0c3d
d617ddb
db35401
 
d617ddb
 
98b0c3d
d617ddb
db35401
d617ddb
 
db35401
 
98b0c3d
db35401
98b0c3d
db35401
 
 
98b0c3d
 
 
d617ddb
 
98b0c3d
db35401
d617ddb
db35401
d617ddb
 
 
 
 
db35401
d617ddb
98b0c3d
d617ddb
98b0c3d
d617ddb
 
 
98b0c3d
db35401
 
 
d617ddb
 
 
 
 
 
 
98b0c3d
 
 
d617ddb
 
 
98b0c3d
 
d617ddb
 
98b0c3d
d617ddb
 
 
 
 
98b0c3d
 
d617ddb
 
98b0c3d
d617ddb
 
98b0c3d
d617ddb
98b0c3d
 
 
 
 
d617ddb
98b0c3d
 
d617ddb
 
 
 
 
98b0c3d
 
 
 
 
 
 
 
 
 
 
 
d617ddb
 
98b0c3d
 
 
d617ddb
98b0c3d
d617ddb
98b0c3d
 
 
 
d617ddb
98b0c3d
d617ddb
98b0c3d
 
 
 
d617ddb
 
 
 
98b0c3d
d617ddb

---
language: 
- vi
tags:
- hate-speech-detection
- vietnamese-nlp
- text-classification
- offensive-speech
license: mit
datasets:
- vihsd
base_model: vinai/bartpho-syllable-base
---

# BARTPHO

BARTpho fine-tuned cho bài toán phân loại Hate Speech tiếng Việt.

## Model Details

- **Model type**: Fine-tuned transformer model
- **Architecture**: BARTpho (Bidirectional and Auto-Regressive Transformer cho tiếng Việt)
- **Base model**: [vinai/bartpho-syllable-base](https://huggingface.co/vinai/bartpho-syllable-base)
- **Task**: Hate Speech Classification
- **Language**: Vietnamese
- **Labels**: CLEAN (0), OFFENSIVE (1), HATE (2)

## 📊 Model Performance

| Metric | Score |
|--------|-------|
| Accuracy | 0.8985 |
| F1 Macro | 0.6791 |
| F1 Weighted | 0.8886 |


## Model Description

BARTpho fine-tuned cho bài toán phân loại Hate Speech tiếng Việt. Model này được fine-tune từ `vinai/bartpho-syllable-base` trên dataset ViHSD (Vietnamese Hate Speech Dataset).

## How to Use

### Basic Usage

```python
from transformers import AutoTokenizer, AutoModelForSequenceClassification
import torch

# Load model and tokenizer
model_name = "visolex/hate-speech-bartpho"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForSequenceClassification.from_pretrained(model_name)

# Classify text
text = "Văn bản tiếng Việt cần phân loại"
inputs = tokenizer(text, return_tensors="pt", padding=True, truncation=True)

with torch.no_grad():
    outputs = model(**inputs)
    predictions = torch.nn.functional.softmax(outputs.logits, dim=-1)
    predicted_label = torch.argmax(predictions, dim=-1).item()

# Label mapping
label_names = {
    0: "CLEAN",
    1: "OFFENSIVE",
    2: "HATE"
}

print(f"Predicted label: {label_names[predicted_label]}")
print(f"Confidence scores: {predictions[0].tolist()}")
```

### Using the Pipeline

```python
from transformers import pipeline

classifier = pipeline(
    "text-classification",
    model="visolex/hate-speech-bartpho",
    tokenizer="visolex/hate-speech-bartpho"
)

result = classifier("Văn bản tiếng Việt cần phân loại")
print(result)
```

## Training Details

### Training Data
- Dataset: ViHSD (Vietnamese Hate Speech Dataset)
- Training samples: ~8,000 samples
- Validation samples: ~1,000 samples
- Test samples: ~1,000 samples

### Training Procedure
- Framework: PyTorch + Transformers
- Optimizer: AdamW
- Learning Rate: 2e-5
- Batch Size: 32
- Epochs: Varies by model
- Max Sequence Length: 256

### Label Distribution
- CLEAN (0): Normal content without offensive language
- OFFENSIVE (1): Mildly offensive content
- HATE (2): Hate speech and extremist language

## Evaluation

Model được đánh giá trên test set của ViHSD với các metrics:
- Accuracy: Overall classification accuracy
- F1 Macro: Macro-averaged F1 score across all labels
- F1 Weighted: Weighted F1 score based on label frequency

## Limitations and Bias

- Model chỉ được train trên dữ liệu tiếng Việt từ mạng xã hội
- Performance có thể giảm trên domain khác (email, document, etc.)
- Model có thể có bias từ dữ liệu training
- Cần đánh giá thêm trên dữ liệu real-world

## Citation


## Contact


## License

This model is distributed under the MIT License.