Yuxuan YAO

yyuxuan

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL

upvoted a paper 6 months ago

MMDeepResearch-Bench: A Benchmark for Multimodal Deep Research Agents

liked a dataset 8 months ago

open-r1/OpenR1-Math-220k

View all activity

Organizations

None yet

upvoted a paper about 2 months ago

EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL

Paper • 2605.18703 • Published May 18 • 50

upvoted a paper 6 months ago

MMDeepResearch-Bench: A Benchmark for Multimodal Deep Research Agents

Paper • 2601.12346 • Published Jan 18 • 52

upvoted a paper 8 months ago

JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence

Paper • 2510.23538 • Published Oct 27, 2025 • 99

upvoted a paper 10 months ago

Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration

Paper • 2508.13755 • Published Aug 19, 2025 • 14

upvoted a paper 11 months ago

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

Paper • 2508.14029 • Published Aug 19, 2025 • 119

upvoted a paper about 1 year ago

Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

Paper • 2506.07044 • Published Jun 8, 2025 • 114

upvoted a paper over 1 year ago

FINEREASON: Evaluating and Improving LLMs' Deliberate Reasoning through Reflective Puzzle Solving

Paper • 2502.20238 • Published Feb 27, 2025 • 23