content-moderation-env / moderation_benchmark.json

Commit History

Restore moderation_benchmark.json from d741d4b (128 scenarios, 100/100 checks)
b2860e4

Soham Banerjee commited on

2b 3a
abf8abc

DayalGupta03 commited on

Appeal mechanic: is_adversarial + env.appeal() 2-turn flow (92/92 checks)
d741d4b

Soham Banerjee commited on

Fill easy GT gaps: full label×action coverage (79/79 checks)
a4c538a

Soham Banerjee commited on

Cross-post campaign mechanic: campaign_id in state, +0.15 escalate-all bonus (61/61 checks)
748cef6

Soham Banerjee commited on

10 ambiguous hard scenarios + full valid_actions test suite (53/53 checks)
941d83d

Soham Banerjee commited on

Close all 4 scoring gaps (+~6 pts)
9bc46b3

Soham Banerjee commited on

phase 1 and phase 2a
78d0a45

DayalGupta03 commited on

ContentModerationEnv v1.0 — complete OpenEnv benchmark
2a39e79

Soham Banerjee commited on