Yilun Zhao PRO

yilunzhao

·

AI & ML interests

None yet

Recent Activity

published a dataset 1 day ago

allenai/sage-retrieval

updated a dataset 1 day ago

allenai/sage-retrieval

upvoted a paper 3 days ago

Dockerless: Environment-Free Program Verifier for Coding Agents

View all activity

Organizations

upvoted 2 papers 3 days ago

Dockerless: Environment-Free Program Verifier for Coding Agents

Paper • 2606.28436 • Published 8 days ago • 102

Reinforcement Learning with Metacognitive Feedback Elicits Faithful Uncertainty Expression in LLMs

Paper • 2606.32032 • Published 4 days ago • 21

upvoted a paper 8 days ago

GUI vs. CLI: Execution Bottlenecks in Screen-Only and Skill-Mediated Computer-Use Agents

Paper • 2606.24551 • Published 12 days ago • 28

upvoted a paper 10 days ago

Qwen-AgentWorld: Language World Models for General Agents

Paper • 2606.24597 • Published 11 days ago • 144

upvoted a paper 16 days ago

VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models

Paper • 2606.16140 • Published 19 days ago • 121

upvoted 2 papers 18 days ago

EurekAgent: Agent Environment Engineering is All You Need For Autonomous Scientific Discovery

Paper • 2606.13662 • Published 23 days ago • 28

Benchmarking AI Agents for Addressing Scientific Challenges Across Scales

Paper • 2606.12736 • Published 24 days ago • 5

upvoted a paper 29 days ago

VideoKR: Towards Knowledge- and Reasoning-Intensive Video Understanding

Paper • 2606.05259 • Published Jun 3 • 39

upvoted 3 papers about 1 month ago

MobileGym: A Verifiable and Highly Parallel Simulation Platform for Mobile GUI Agent Research

Paper • 2605.26114 • Published May 25 • 65

Your Embedding Model is SMARTer Than You Think

Paper • 2605.24938 • Published May 24 • 25

CUA-Gym: Scaling Verifiable Training Environments and Tasks for Computer-Use Agents

Paper • 2605.25624 • Published May 25 • 34

upvoted 3 papers about 2 months ago

OpenComputer: Verifiable Software Worlds for Computer-Use Agents

Paper • 2605.19769 • Published May 19 • 85

Qwen-Image-2.0 Technical Report

Paper • 2605.10730 • Published May 11 • 116

Rethinking Reasoning-Intensive Retrieval: Evaluating and Advancing Retrievers in Agentic Search Systems

Paper • 2605.04018 • Published May 5 • 41

upvoted 4 papers 2 months ago

Step-level Optimization for Efficient Computer-use Agents

Paper • 2604.27151 • Published Apr 29 • 19

Rewarding the Scientific Process: Process-Level Reward Modeling for Agentic Data Analysis

Paper • 2604.24198 • Published Apr 27 • 23

TexOCR: Advancing Document OCR Models for Compilable Page-to-LaTeX Reconstruction

Paper • 2604.22880 • Published Apr 24 • 10

Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

Paper • 2604.18292 • Published Apr 20 • 88

upvoted 2 papers 3 months ago

Watch Before You Answer: Learning from Visually Grounded Post-Training

Paper • 2604.05117 • Published Apr 6 • 36

Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning

Paper • 2604.04746 • Published Apr 8 • 73