arxiv:2606.04923
Hao Zhuoyuan 郝卓远
larry2210
AI & ML interests
None yet
Recent Activity
authored a paper about 3 hours ago
Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement Learning upvoted a paper about 8 hours ago
Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement Learning submitted a paper about 9 hours ago
Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement LearningOrganizations
None yet