view article Article EVA-Bench Data 2.0: 3 Domains, 121 Tools, 213 Scenarios ServiceNow-AI • 3 days ago • 34
Azimuth: Systematic Error Analysis for Text Classification Paper • 2212.08216 • Published Dec 16, 2022
EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents Paper • 2605.13841 • Published 25 days ago • 72
EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents Paper • 2605.13841 • Published 25 days ago • 72