Restore moderation_benchmark.json from d741d4b (128 scenarios, 100/100 checks) b2860e4 Soham Banerjee commited on Apr 5
Appeal mechanic: is_adversarial + env.appeal() 2-turn flow (92/92 checks) d741d4b Soham Banerjee commited on Apr 4
Fill easy GT gaps: full label×action coverage (79/79 checks) a4c538a Soham Banerjee commited on Apr 4
Cross-post campaign mechanic: campaign_id in state, +0.15 escalate-all bonus (61/61 checks) 748cef6 Soham Banerjee commited on Apr 4
10 ambiguous hard scenarios + full valid_actions test suite (53/53 checks) 941d83d Soham Banerjee commited on Apr 4