ZH

zzh068

tencent

2 1

·

AI & ML interests

None yet

Recent Activity

authored a paper 1 day ago

C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation Models

authored a paper 1 day ago

AgentBoard: An Analytical Evaluation Board of Multi-turn LLM Agents

authored a paper 1 day ago

When Search Agents Should Ask: DiscoBench for Clarification-Aware Deep Search

View all activity

Organizations

authored 4 papers 1 day ago

C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation Models

Paper • 2305.08322 • Published May 15, 2023

AgentBoard: An Analytical Evaluation Board of Multi-turn LLM Agents

Paper • 2401.13178 • Published Jan 24, 2024

When Search Agents Should Ask: DiscoBench for Clarification-Aware Deep Search

Paper • 2606.27669 • Published 5 days ago

PASG: A Closed-Loop Framework for Automated Geometric Primitive Extraction and Semantic Anchoring in Robotic Manipulation

Paper • 2508.05976 • Published Aug 8, 2025

upvoted a paper 5 months ago

Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations

Paper • 2602.05885 • Published Feb 5 • 28

upvoted a paper about 2 years ago

INDUS: Effective and Efficient Language Models for Scientific Applications

Paper • 2405.10725 • Published May 17, 2024 • 35

liked a dataset about 3 years ago

ceval/ceval-exam

Viewer • Updated Jul 27, 2025 • 13.9k • 38.2k • 302