File size: 1,752 Bytes
2e6be04
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2600011
 
2e6be04
 
 
 
 
 
 
 
 
 
20da490
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2e6be04
 
 
 
 
 
 
506065e
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
---
license: mit
language:
- de
metrics:
- name: f1
  value: 0.79
- name: auc
  value: 0.91
- name: accuracy
  value: 0.84
base_model:
- google-bert/bert-base-german-cased
pipeline_tag: text-classification
---
# Horbee/bert-german-offensive-comment-classifier aka SauerBERT

SauerBERT is a fine-tuned German BERT-based transformer model for offensive comment detection. 
It was trained on a balanced dataset of 8,000 examples from the GermEval 2018 and 2019 shared tasks, fine-tuned for 2 epochs. The model achieves strong performance metrics on German online comments, including:

- Accuracy: 84.3%
- F1 Score: 0.796
- Precision: 0.784
- Recall: 0.808
- AUC: 0.91

SauerBERT is designed to help detect offensive language, and rude comments in German text, making it suitable for moderation systems, research, or content analysis pipelines.

## Intended Use:

Detection of offensive, or inappropriate German-language comments

Social media moderation tools

## Example Use:

```python
from transformers import pipeline

classifier = pipeline("text-classification",
                      model="Horbee/bert-german-offensive-comment-classifier")

sequence_to_classify = "Ich kann es nicht ausstehen, mit so einem Idioten im selben Raum zu sein."

result = classifier(sequence_to_classify)

print(result) # [{'label': 'Offensive', 'score': 0.9911119341850281}]

```

## Limitations:

Trained only on GermEval 2018/2019 data — performance on out-of-domain or highly informal texts may vary.

May not capture all forms of subtle toxicity or sarcasm.

Designed for German-language content; not suitable for other languages.

## Author comments

Thank you for using my model, let me know if it helped you out. I would appreciate any constructive feedback.