Henry Hengyuan Zhao's picture

Henry Hengyuan Zhao PRO

hhenryz

·

https://zhaohengyuan1.github.io/

AI & ML interests

Multimodal Reasoning, Human-AI Interaction, GUI Automation

Recent Activity

upvoted a paper 8 days ago

Data Journalist Agent: Transforming Data into Verifiable Multimodal Stories

upvoted a paper 17 days ago

ResearchClawBench: A Benchmark for End-to-End Autonomous Scientific Research

new activity 3 months ago

anonymousABC/WorldGUI-Bench:Update worldgui_metadata.json

View all activity

Organizations

upvoted a paper 8 days ago

Data Journalist Agent: Transforming Data into Verifiable Multimodal Stories

Paper • 2606.11176 • Published 17 days ago • 126

upvoted a paper 17 days ago

ResearchClawBench: A Benchmark for End-to-End Autonomous Scientific Research

Paper • 2606.07591 • Published 29 days ago • 95

New activity in anonymousABC/WorldGUI-Bench 3 months ago

Update worldgui_metadata.json

#2 opened 3 months ago by

upvoted a collection 4 months ago

Qwen3.5

21 items • Updated Mar 9 • 1.69k

New activity in hhenryz/WorldGUI-Bench 4 months ago

Add task categories and link to source code

#1 opened 4 months ago by

upvoted a paper 7 months ago

Computer-Use Agents as Judges for Generative User Interface

Paper • 2511.15567 • Published Nov 19, 2025 • 54

upvoted a paper 8 months ago

Grounding Computer Use Agents on Human Demonstrations

Paper • 2511.07332 • Published Nov 10, 2025 • 107

liked a dataset 8 months ago

open-thoughts/OpenThoughts3-1.2M

Viewer • Updated Jun 9, 2025 • 1.2M • 19.9k • 243

upvoted 2 papers 8 months ago

Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs

Paper • 2506.14245 • Published Jun 17, 2025 • 45

VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation

Paper • 2511.02778 • Published Nov 4, 2025 • 104

liked a dataset 8 months ago

CSU-JPG/Chart2Code_old

Updated Jan 21 • 211 • 5

updated a collection 8 months ago

Personal Interest

5 items • Updated Oct 23, 2025

upvoted a paper 8 months ago

From Charts to Code: A Hierarchical Benchmark for Multimodal Models

Paper • 2510.17932 • Published Oct 20, 2025 • 8

commented a paper 8 months ago

From Charts to Code: A Hierarchical Benchmark for Multimodal Models

Paper • 2510.17932 • Published Oct 20, 2025 • 8 •

upvoted a collection 8 months ago

Qwen3-VL

37 items • Updated Dec 31, 2025 • 747

upvoted 3 papers 9 months ago

Paper2Video: Automatic Video Generation from Scientific Papers

Paper • 2510.05096 • Published Oct 6, 2025 • 120

LongLive: Real-time Interactive Long Video Generation

Paper • 2509.22622 • Published Sep 26, 2025 • 189

BaseReward: A Strong Baseline for Multimodal Reward Model

Paper • 2509.16127 • Published Sep 19, 2025 • 21

liked a dataset 11 months ago

lmms-lab/TempCompass

Viewer • Updated Jun 10, 2024 • 7.54k • 4.08k • 6

upvoted a collection 11 months ago

NVILA

NVILA: Efficient Frontier Visual Language Models • 12 items • Updated Mar 10 • 18