darwinkernelpanic commited on
Commit
7db9699
·
verified ·
1 Parent(s): 1ac7b0c

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +38 -0
README.md ADDED
@@ -0,0 +1,38 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: en
3
+ license: mit
4
+ library_name: sklearn
5
+ tags:
6
+ - content-moderation
7
+ - text-classification
8
+ - safety
9
+ - dual-mode
10
+ ---
11
+
12
+ # moderat - Dual-Mode Content Moderation
13
+
14
+ A text classification model for content moderation with age-appropriate filtering.
15
+
16
+ ## Usage
17
+
18
+ ```python
19
+ from inference import DualModeFilter
20
+
21
+ filter = DualModeFilter("darwinkernelpanic/moderat")
22
+ result = filter.check("damn that's crazy", age=15)
23
+ # -> ALLOWED (reaction swearing permitted for 13+)
24
+ ```
25
+
26
+ ## Model Details
27
+
28
+ - **Algorithm:** Multinomial Naive Bayes with TF-IDF
29
+ - **Test accuracy:** 77%
30
+ - **Classes:** 6 (Safe, Harassment, Swearing-Reaction, Swearing-Aggressive, Hate-Speech, Spam)
31
+
32
+ ## Age Modes
33
+
34
+ | Content | <13 | 13+ |
35
+ |---------|-----|-----|
36
+ | "damn that's crazy" | ❌ Blocked | ✅ Allowed |
37
+ | "you're trash" | ❌ Blocked | ❌ Blocked |
38
+ | "kill yourself" | ❌ Blocked | ❌ Blocked |