Zhihan Liu's picture

5

Zhihan Liu

ZHLiu627

·

AI & ML interests

LLMs

Organizations

None yet

upvoted a paper 4 months ago

Paying Less Generalization Tax: A Cross-Domain Generalization Study of RL Training for LLM Agents

Paper • 2601.18217 • Published Jan 26 • 13

upvoted a paper 6 months ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 276

upvoted a paper 7 months ago

Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published Nov 5, 2025 • 83

upvoted 2 papers over 1 year ago

Self-rewarding correction for mathematical reasoning

Paper • 2502.19613 • Published Feb 26, 2025 • 82

Provably Mitigating Overoptimization in RLHF: Your SFT Loss is Implicitly an Adversarial Regularizer

Paper • 2405.16436 • Published May 26, 2024 • 1