Compare AI responses and gate regressions
Analyze text for potential jailbreak risks
Analyze user intent and assess model responses
Evaluate text for toxicity and fairness