Analyze user intent and assess model responses
Compare AI responses and gate regressions
Analyze text for potential jailbreak risks
Evaluate text for toxicity and fairness