arxiv:2512.02834
haoran he
haoranhe
AI & ML interests
None yet
Recent Activity
upvoted a paper about 20 hours ago
UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation upvoted a paper 7 days ago
Complementary Reinforcement Learning upvoted a paper 3 months ago
GARDO: Reinforcing Diffusion Models without Reward HackingOrganizations
None yet