|
|
--- |
|
|
license: mit |
|
|
language: |
|
|
- de |
|
|
metrics: |
|
|
- name: f1 |
|
|
value: 0.79 |
|
|
- name: auc |
|
|
value: 0.91 |
|
|
- name: accuracy |
|
|
value: 0.84 |
|
|
base_model: |
|
|
- google-bert/bert-base-german-cased |
|
|
pipeline_tag: text-classification |
|
|
--- |
|
|
# Horbee/bert-german-offensive-comment-classifier aka SauerBERT |
|
|
|
|
|
SauerBERT is a fine-tuned German BERT-based transformer model for offensive comment detection. |
|
|
It was trained on a balanced dataset of 8,000 examples from the GermEval 2018 and 2019 shared tasks, fine-tuned for 2 epochs. The model achieves strong performance metrics on German online comments, including: |
|
|
|
|
|
- Accuracy: 84.3% |
|
|
- F1 Score: 0.796 |
|
|
- Precision: 0.784 |
|
|
- Recall: 0.808 |
|
|
- AUC: 0.91 |
|
|
|
|
|
SauerBERT is designed to help detect offensive language, and rude comments in German text, making it suitable for moderation systems, research, or content analysis pipelines. |
|
|
|
|
|
## Intended Use: |
|
|
|
|
|
Detection of offensive, or inappropriate German-language comments |
|
|
|
|
|
Social media moderation tools |
|
|
|
|
|
## Example Use: |
|
|
|
|
|
```python |
|
|
from transformers import pipeline |
|
|
|
|
|
classifier = pipeline("text-classification", |
|
|
model="Horbee/bert-german-offensive-comment-classifier") |
|
|
|
|
|
sequence_to_classify = "Ich kann es nicht ausstehen, mit so einem Idioten im selben Raum zu sein." |
|
|
|
|
|
result = classifier(sequence_to_classify) |
|
|
|
|
|
print(result) # [{'label': 'Offensive', 'score': 0.9911119341850281}] |
|
|
|
|
|
``` |
|
|
|
|
|
## Limitations: |
|
|
|
|
|
Trained only on GermEval 2018/2019 data — performance on out-of-domain or highly informal texts may vary. |
|
|
|
|
|
May not capture all forms of subtle toxicity or sarcasm. |
|
|
|
|
|
Designed for German-language content; not suitable for other languages. |
|
|
|
|
|
## Author comments |
|
|
|
|
|
Thank you for using my model, let me know if it helped you out. I would appreciate any constructive feedback. |
|
|
|