Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
1
4
Zhenghao ZHU
zhenghaozhu
Follow
AI & ML interests
Large Language Model, Agents
Recent Activity
authored
a paper
about 21 hours ago
MedInsightBench: Evaluating Medical Analytics Agents Through Multi-Step Insight Discovery in Multimodal Medical Data
authored
a paper
about 21 hours ago
InsightEval: An Expert-Curated Benchmark for Assessing Insight Discovery in LLM-Driven Data Agents
updated
a dataset
8 months ago
zhenghaozhu/llm-hkmmlu-leaderboard-results
View all activity
Organizations
None yet
zhenghaozhu
's Spaces
1
Sort:Â Recently updated
pinned
Runtime error
2
LLM HKMMLU Leaderboard
🥇
Explore and submit LLM benchmarks