Pelly's picture

3

Pelly

Pellypp

AI & ML interests

None yet

Recent Activity

upvoted a paper about 9 hours ago

Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement Learning

upvoted a paper about 9 hours ago

STARE: Surprisal-Guided Token-Level Advantage Reweighting for Policy Entropy Stability

upvoted a paper about 2 months ago

Meta-CoT: Enhancing Granularity and Generalization in Image Editing

View all activity

Organizations

None yet

upvoted 2 papers about 9 hours ago

Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement Learning

Paper • 2606.04923 • Published 16 days ago • 39

STARE: Surprisal-Guided Token-Level Advantage Reweighting for Policy Entropy Stability

Paper • 2606.19236 • Published 2 days ago • 8

upvoted a paper about 2 months ago

Meta-CoT: Enhancing Granularity and Generalization in Image Editing

Paper • 2604.24625 • Published Apr 27 • 26