Buckets:

seffyehl
/

BetterShield-bucket

3.53 GB

8 files

Updated about 1 month ago

Ctrl+K

Name	Size	Uploaded	Xet hash
.gitattributes	1.52 kB xet	about 1 month ago	818ba6de
AlteredModel.pth	886 MB xet	about 1 month ago	76e7dde2
AlteredShield.pth	882 MB xet	about 1 month ago	cb2893c6
BaseShield.pth	879 MB xet	about 1 month ago	a7c10b14
BaselineShield.pth	879 MB xet	about 1 month ago	e0832480
README.md	1.59 kB xet	about 1 month ago	84f7ce52
alter_config.json	856 Bytes xet	about 1 month ago	13d1f713
base_config.json	741 Bytes xet	about 1 month ago	914393a8

README.md

language: en license: mit tags: - hate-speech-detection - text-classification - rationale-extraction datasets: - your-dataset-name metrics: - accuracy - f1 model-index: - name: hate-speech-rationale-model results: - task: type: text-classification name: Hate Speech Detection metrics: - type: accuracy value: 0.XX # Your test accuracy - type: f1 value: 0.XX # Your test F1

Hate Speech Detection with Rationale Extraction

This model detects hate speech in text and provides rationale explanations for its predictions.

Model Description

Architecture: HateBERT + Rationale BERT + Multi-Scale CNN + Attention
Training Data: [Describe your dataset]
Performance:
- Validation Loss: 0.27
- Test Accuracy: XX%
- Test F1: XX%

Usage

from huggingface_hub import hf_hub_download
import torch

# Download model
model_path = hf_hub_download(
    repo_id="seffyehl/BetterShield",
    filename="pytorch_model.pth"
)

# Load model (see full example in repository)
checkpoint = torch.load(model_path, map_location='cpu')
# ... (rest of loading code)

Training Details

Batch Size: 8
Learning Rate: 1e-5
Weight Decay: 0.05
Dropout: 0.5
Epochs: Stopped early at epoch X

Limitations

[Describe any known limitations]

Citation

@misc{bettershield-2025,
  author = Orion,
  title = Hate Speech Detection with Rationale Extraction,
  year = 2025,
  publisher = HuggingFace,
  url = https://huggingface.co/seffyehl/BetterShield
}

Total size: 3.53 GB

Files: 8

Last updated: May 25

Pre-warmed CDN: US EU US EU

Hate Speech Detection with Rationale Extraction

Model Description

Usage

Training Details

Limitations

Citation

Contributors