Hao Zhuoyuan 郝卓远's picture

Hao Zhuoyuan 郝卓远

larry2210

·

https://github.com/hhh2210

hzy2210

AI & ML interests

None yet

Recent Activity

authored a paper about 4 hours ago

Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement Learning

upvoted a paper about 9 hours ago

Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement Learning

submitted a paper about 10 hours ago

Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement Learning

View all activity

Organizations

None yet

upvoted a paper about 9 hours ago

Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement Learning

Paper • 2606.04923 • Published 1 day ago • 33

upvoted a paper 4 months ago

Echoes as Anchors: Probabilistic Costs and Attention Refocusing in LLM Reasoning

Paper • 2602.06600 • Published Feb 6 • 3