visolex
/

roberta-gru-hsd

@@ -1,139 +1,119 @@
 ---
-language:
-- vi
 tags:
 - hate-speech-detection
-- vietnamese-nlp
 - text-classification
 - offensive-language-detection
-license: mit
 datasets:
-- vihsd
-base_model: vinai/phobert-base-v2
 ---
-# RoBERTa-GRU Hybrid
-Hybrid model kết hợp PhoBERT-V2 và GRU cho bài toán phân loại Hate Speech
 ## Model Details
-### Model Type
-PhoBERT-V2 + Bidirectional GRU
-### Base Model
-This model is fine-tuned from [vinai/phobert-base-v2](https://huggingface.co/vinai/phobert-base-v2)
-### Training Info
-- **Task**: Hate Speech Classification
-- **Language**: Vietnamese
-- **Labels**:
-  - `0`: CLEAN (Normal content)
-  - `1`: OFFENSIVE (Mildly offensive content)
-  - `2`: HATE (Hate speech)
-## 📊 Model Performance
-| Metric | Score |
-|--------|-------|
-| Accuracy | 0.9537 |
-| F1 Macro | 0.8716 |
-| F1 Weighted | 0.9530 |
-## Model Description
-This model has been fine-tuned on the ViHSD (Vietnamese Hate Speech Dataset) to classify Vietnamese text into three categories: CLEAN, OFFENSIVE, and HATE.
-### Architecture
-PhoBERT-V2 + Bidirectional GRU
-The model combines the powerful pretrained representations with task-specific fine-tuning for effective hate speech detection in Vietnamese social media content.
-## How to Use
-### 1. Using Transformers Pipeline
-```python
-from transformers import pipeline
-# Initialize the hate speech classifier
-classifier = pipeline(
-    "text-classification",
-    model="visolex/hate-speech-roberta-gru",
-    tokenizer="visolex/hate-speech-roberta-gru",
-    return_all_scores=True
-)
-# Classify text
-results = classifier("Văn bản tiếng Việt cần kiểm tra")
-print(results)
-```
-### 2. Using AutoModel
 ```python
 from transformers import AutoTokenizer, AutoModelForSequenceClassification
 import torch
 # Load model and tokenizer
-model_name = "visolex/hate-speech-roberta-gru"
 tokenizer = AutoTokenizer.from_pretrained(model_name)
-model = AutoModelForSequenceClassification.from_pretrained(model_name)
-# Prepare text
-text = "Văn bản tiếng Việt cần kiểm tra"
-inputs = tokenizer(text, return_tensors="pt", padding=True, truncation=True, max_length=256)
-# Get predictions
 with torch.no_grad():
     outputs = model(**inputs)
-    logits = outputs.logits
-    # Get probabilities
-    probabilities = torch.nn.functional.softmax(logits, dim=-1)
-    # Get predicted label
-    predicted_label = torch.argmax(probabilities, dim=-1).item()
-    confidence = probabilities[0][predicted_label].item()
 # Label mapping
-label_mapping = {
     0: "CLEAN",
     1: "OFFENSIVE",
     2: "HATE"
 }
-print(f"Predicted: {label_mapping[predicted_label]} (Confidence: {confidence:.2%})")
 ```
-### 3. Batch Processing
-```python
-from transformers import AutoTokenizer, AutoModelForSequenceClassification
-import torch
-model_name = "visolex/hate-speech-roberta-gru"
-tokenizer = AutoTokenizer.from_pretrained(model_name)
-model = AutoModelForSequenceClassification.from_pretrained(model_name)
-# List of texts to classify
-texts = [
-    "Bài viết rất hay và bổ ích",
-    "Đồ ngu người ta nói đúng mà",
-    "Cút đi đồ chó"
-]
-# Tokenize and predict
-inputs = tokenizer(texts, return_tensors="pt", padding=True, truncation=True, max_length=256)
-with torch.no_grad():
-    outputs = model(**inputs)
-    predictions = torch.argmax(outputs.logits, dim=-1)
-for text, pred in zip(texts, predictions):
-    label = ["CLEAN", "OFFENSIVE", "HATE"][pred.item()]
-    print(f"{text[:50]} -> {label}")
-```
 ## Training Details
@@ -150,49 +130,10 @@ for text, pred in zip(texts, predictions):
 - **Learning Rate**: 2e-5
 - **Batch Size**: 32
 - **Max Length**: 256 tokens
-- **Epochs**: Optimized via early stopping
-### Preprocessing
-- Text normalization for Vietnamese
-- Special character handling
-- Emoji and slang processing
-## Evaluation Results
-Model evaluation metrics on the ViHSD test set: See Model Performance section above for details.
-### Label Distribution
-- **CLEAN (0)**: Normal content without offensive language
-- **OFFENSIVE (1)**: Mildly offensive or inappropriate content
-- **HATE (2)**: Hate speech, extremist language, severe threats
-## Use Cases
-- **Social Media Moderation**: Automatic detection of hate speech in Vietnamese social media platforms
-- **Content Filtering**: Filtering offensive content in Vietnamese text
-- **Research**: Studying hate speech patterns in Vietnamese online communities
-## Limitations and Considerations
-⚠️ **Important Limitations**:
-- Model trained primarily on social media data, may not generalize to formal text
-- Performance may vary with slang, code-switching, or regional dialects
-- Model reflects biases present in training data
-- Should be used as part of a larger moderation system, not sole decision-maker
-## Citation
-If you use this model in your research, please cite:
-```bibtex
-@software{vihsd_roberta-gru,
-  title = {RoBERTa-GRU Hybrid for Vietnamese Hate Speech Detection},
-  author = {ViSoLex Team},
-  year = {2024},
-  url = {https://huggingface.co/visolex/hate-speech-roberta-gru},
-  base_model = {vinai/phobert-base-v2}
-}
-```
 ## Contact & Support
@@ -206,6 +147,9 @@ This model is distributed under the MIT License.
 ## Acknowledgments
-- Base model trained by vinai
 - Dataset: ViHSD (Vietnamese Hate Speech Detection Dataset)
 - Framework: [Hugging Face Transformers](https://huggingface.co/transformers)

 ---
+license: mit
+base_model: vinai/phobert-base-v2
 tags:
+- vietnamese
 - hate-speech-detection
 - text-classification
 - offensive-language-detection
 datasets:
+- visolex/vihsd
+metrics:
+- accuracy
+- macro-f1
+- weighted-f1
+model-index:
+- name: roberta-gru-hsd
+  results:
+  - task:
+      type: text-classification
+      name: Hate Speech Detection
+    dataset:
+      name: ViHSD
+      type: hate-speech-detection
+    metrics:
+    - type: accuracy
+      value: 0.9537
+    - type: macro-f1
+      value: 0.8716
+    - type: weighted-f1
+      value: 0.9530
+    - type: macro-precision
+      value: 0.8870
+    - type: macro-recall
+      value: 0.8573
 ---
+# RoBERTa-GRU Hybrid: Hate Speech Detection for Vietnamese Text
+This model is a fine-tuned version of [vinai/phobert-base-v2](https://huggingface.co/vinai/phobert-base-v2)
+on the **ViHSD (Vietnamese Hate Speech Detection Dataset)** for classifying Vietnamese text into three categories: CLEAN, OFFENSIVE, and HATE.
 ## Model Details
+* **Base Model**: vinai/phobert-base-v2
+* **Description**: Hybrid model kết hợp PhoBERT-V2 và GRU cho bài toán phân loại Hate Speech
+* **Architecture**: PhoBERT-V2 + Bidirectional GRU
+* **Dataset**: ViHSD (Vietnamese Hate Speech Detection Dataset)
+* **Fine-tuning Framework**: HuggingFace Transformers + PyTorch
+* **Task**: Hate Speech Classification (3 classes)
+### Hyperparameters
+* **Batch size**: `32`
+* **Learning rate**: `2e-5`
+* **Epochs**: `100`
+* **Max sequence length**: `256`
+* **Weight decay**: `0.01`
+* **Warmup steps**: `500`
+* **Early stopping patience**: `5`
+* **Optimizer**: AdamW
+* **Learning rate scheduler**: Cosine with warmup
+## Dataset
+Model was trained on **ViHSD (Vietnamese Hate Speech Detection Dataset)** containing ~10,000 Vietnamese comments from social media.
+### Label Descriptions:
+* **CLEAN (0)**: Normal content without offensive language
+* **OFFENSIVE (1)**: Mildly offensive or inappropriate content
+* **HATE (2)**: Hate speech, extremist language, severe threats
+## Evaluation Results
+The model was evaluated on test set with the following metrics:
+* **Accuracy**: `0.9537`
+* **Macro-F1**: `0.8716`
+* **Weighted-F1**: `0.9530`
+* **Macro-Precision**: `0.8870`
+* **Macro-Recall**: `0.8573`
+### Basic Usage
 ```python
 from transformers import AutoTokenizer, AutoModelForSequenceClassification
 import torch
 # Load model and tokenizer
+model_name = "visolex/roberta-gru-hsd"
 tokenizer = AutoTokenizer.from_pretrained(model_name)
+model = AutoModelForSequenceClassification.from_pretrained(
+    model_name
+)
+# Classify text
+text = "Văn bản tiếng Việt cần phân loại"
+inputs = tokenizer(text, return_tensors="pt", padding=True, truncation=True)
 with torch.no_grad():
     outputs = model(**inputs)
+    predictions = torch.nn.functional.softmax(outputs.logits, dim=-1)
+    predicted_label = torch.argmax(predictions, dim=-1).item()
 # Label mapping
+label_names = {
     0: "CLEAN",
     1: "OFFENSIVE",
     2: "HATE"
 }
+print(f"Predicted label: {label_names[predicted_label]}")
+print(f"Confidence scores: {predictions[0].tolist()}")
 ```
 ## Training Details
 - **Learning Rate**: 2e-5
 - **Batch Size**: 32
 - **Max Length**: 256 tokens
+- **Epochs**: 100 (with early stopping patience: 5)
+- **Weight Decay**: 0.01
+- **Warmup Steps**: 500
 ## Contact & Support
 ## Acknowledgments
+- Base model: [vinai/phobert-base-v2](https://huggingface.co/vinai/phobert-base-v2)
 - Dataset: ViHSD (Vietnamese Hate Speech Detection Dataset)
 - Framework: [Hugging Face Transformers](https://huggingface.co/transformers)
+- ViSoLex Toolkit
+---