Horbee/bert-german-offensive-comment-classifier aka SauerBERT
SauerBERT is a fine-tuned German BERT-based transformer model for offensive comment detection. It was trained on a balanced dataset of 8,000 examples from the GermEval 2018 and 2019 shared tasks, fine-tuned for 2 epochs. The model achieves strong performance metrics on German online comments, including:
- Accuracy: 84.3%
- F1 Score: 0.796
- Precision: 0.784
- Recall: 0.808
- AUC: 0.91
SauerBERT is designed to help detect offensive language, and rude comments in German text, making it suitable for moderation systems, research, or content analysis pipelines.
Intended Use:
Detection of offensive, or inappropriate German-language comments
Social media moderation tools
Example Use:
from transformers import pipeline
classifier = pipeline("text-classification",
model="Horbee/bert-german-offensive-comment-classifier")
sequence_to_classify = "Ich kann es nicht ausstehen, mit so einem Idioten im selben Raum zu sein."
result = classifier(sequence_to_classify)
print(result) # [{'label': 'Offensive', 'score': 0.9911119341850281}]
Limitations:
Trained only on GermEval 2018/2019 data — performance on out-of-domain or highly informal texts may vary.
May not capture all forms of subtle toxicity or sarcasm.
Designed for German-language content; not suitable for other languages.
Author comments
Thank you for using my model, let me know if it helped you out. I would appreciate any constructive feedback.
- Downloads last month
- 37
Model tree for Horbee/bert-german-offensive-comment-classifier
Base model
google-bert/bert-base-german-cased