Yao

distant-yuan

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Skill0.5: Joint Skill Internalization and Utilization for Out-of-Distribution Generalization in Agentic Reinforcement Learning

upvoted a paper about 1 month ago

Learning to Act under Noise: Enhancing Agent Robustness via Noisy Environments

upvoted a paper about 1 month ago

VitaBench 2.0: Evaluating Personalized and Proactive Agents in Long-Term User Interactions

View all activity

Organizations

None yet

upvoted 3 papers about 1 month ago

Skill0.5: Joint Skill Internalization and Utilization for Out-of-Distribution Generalization in Agentic Reinforcement Learning

Paper • 2605.28424 • Published May 27 • 32

Learning to Act under Noise: Enhancing Agent Robustness via Noisy Environments

Paper • 2605.27209 • Published May 26 • 16

VitaBench 2.0: Evaluating Personalized and Proactive Agents in Long-Term User Interactions

Paper • 2605.27141 • Published May 26 • 20

upvoted 2 papers about 2 months ago

Maestro: Reinforcement Learning to Orchestrate Hierarchical Model-Skill Ensembles

Paper • 2605.22177 • Published May 21 • 21

Self-Distilled Agentic Reinforcement Learning

Paper • 2605.15155 • Published May 14 • 116

New activity in ChilleD/WebHarbor about 2 months ago

feat(phys_org): add Phys.org mirror tarball

#7 opened about 2 months ago by

distant-yuan

feat(phys_org): add Phys.org mirror tarball

#6 opened about 2 months ago by

distant-yuan

updated a dataset about 2 months ago

distant-yuan/WebHarbor

Updated May 13 • 26

upvoted a paper about 2 months ago

Dynamic Skill Lifecycle Management for Agentic Reinforcement Learning

Paper • 2605.10923 • Published May 11 • 13

upvoted 3 papers 3 months ago

KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation

Paper • 2604.08455 • Published Apr 9 • 48

SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization

Paper • 2604.02268 • Published Apr 2 • 103

V_{0.5}: Generalist Value Model as a Prior for Sparse RL Rollouts

Paper • 2603.10848 • Published Mar 11 • 16

upvoted a paper 5 months ago

Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models

Paper • 2602.07026 • Published Feb 2 • 140

authored a paper 5 months ago

CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs

Paper • 2602.03048 • Published Feb 3 • 32

upvoted 6 papers 5 months ago

CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs

Paper • 2602.03048 • Published Feb 3 • 32

MemOCR: Layout-Aware Visual Memory for Efficient Long-Horizon Reasoning

Paper • 2601.21468 • Published Jan 29 • 25

LongCat-Flash-Thinking-2601 Technical Report

Paper • 2601.16725 • Published Jan 23 • 183

Yao

AI & ML interests

Recent Activity

Organizations

distant-yuan's activity

feat(phys_org): add Phys.org mirror tarball

feat(phys_org): add Phys.org mirror tarball