darwinkernelpanic
/

moderat

+---
+language: en
+license: mit
+library_name: sklearn
+tags:
+- content-moderation
+- text-classification
+- safety
+- dual-mode
+---
+# moderat - Dual-Mode Content Moderation
+A text classification model for content moderation with age-appropriate filtering.
+## Usage
+```python
+from inference import DualModeFilter
+filter = DualModeFilter("darwinkernelpanic/moderat")
+result = filter.check("damn that's crazy", age=15)
+# -> ALLOWED (reaction swearing permitted for 13+)
+```
+## Model Details
+- **Algorithm:** Multinomial Naive Bayes with TF-IDF
+- **Test accuracy:** 77%
+- **Classes:** 6 (Safe, Harassment, Swearing-Reaction, Swearing-Aggressive, Hate-Speech, Spam)
+## Age Modes
+| Content | <13 | 13+ |
+|---------|-----|-----|
+| "damn that's crazy" | ❌ Blocked | ✅ Allowed |
+| "you're trash" | ❌ Blocked | ❌ Blocked |
+| "kill yourself" | ❌ Blocked | ❌ Blocked |