Jieyu Zhao

jieyuz

·

https://jieyuzhao.github.io

AI & ML interests

LLM, Agents, RL, Alignment

Recent Activity

updated a Space about 2 months ago

shuhengc/MED-COPILOT

upvoted a paper 3 months ago

The Blind Spot of Agent Safety: How Benign User Instructions Expose Critical Vulnerabilities in Computer-Use Agents

upvoted a paper 3 months ago

Structured Distillation of Web Agent Capabilities Enables Generalization

View all activity

Organizations

upvoted 2 papers 3 months ago

The Blind Spot of Agent Safety: How Benign User Instructions Expose Critical Vulnerabilities in Computer-Use Agents

Paper • 2604.10577 • Published Apr 12 • 27

Structured Distillation of Web Agent Capabilities Enables Generalization

Paper • 2604.07776 • Published Apr 9 • 23

upvoted a paper 4 months ago

Video-Based Reward Modeling for Computer-Use Agents

Paper • 2603.10178 • Published Mar 10 • 43

upvoted a paper 5 months ago

Experiential Reinforcement Learning

Paper • 2602.13949 • Published Feb 15 • 76

upvoted a paper 11 months ago

CoAct-1: Computer-using Agents with Coding as Actions

Paper • 2508.03923 • Published Aug 5, 2025 • 13

upvoted 3 papers about 1 year ago

Feedback Friction: LLMs Struggle to Fully Incorporate External Feedback

Paper • 2506.11930 • Published Jun 13, 2025 • 53

The Hallucination Tax of Reinforcement Finetuning

Paper • 2505.13988 • Published May 20, 2025 • 8

Efficient Reinforcement Finetuning via Adaptive Curriculum Learning

Paper • 2504.05520 • Published Apr 7, 2025 • 11