Simulate life scenarios and see outcome metrics
Assess content moderation decisions on benchmark scenarios