Situated Refusals: Arena Data Collection Chatbot Arena (55k+140k) refusal analysis: WildGuard detection, OpenAI/NBER context labels, Beebe strategy classification. • 4 items • Updated 5 days ago
Situated Refusals: Arena Data Collection Chatbot Arena (55k+140k) refusal analysis: WildGuard detection, OpenAI/NBER context labels, Beebe strategy classification. • 4 items • Updated 5 days ago