ZH
ยท
AI & ML interests
None yet
Recent Activity
authored a paper 1 day ago
C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for
Foundation Models authored a paper 1 day ago
AgentBoard: An Analytical Evaluation Board of Multi-turn LLM Agents authored a paper 1 day ago
When Search Agents Should Ask: DiscoBench for Clarification-Aware Deep Search