haoran he's picture

haoran he

haoranhe

·

tinnerhrhe

AI & ML interests

None yet

Recent Activity

upvoted a paper 11 days ago

AlphaGRPO: Unlocking Self-Reflective Multimodal Generation in UMMs via Decompositional Verifiable Reward

upvoted a paper about 1 month ago

OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models

upvoted a paper 2 months ago

UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation

View all activity

Organizations

None yet

upvoted a paper 11 days ago

AlphaGRPO: Unlocking Self-Reflective Multimodal Generation in UMMs via Decompositional Verifiable Reward

Paper • 2605.12495 • Published 13 days ago • 35

upvoted a paper about 1 month ago

OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models

Paper • 2604.10866 • Published Apr 13 • 66

upvoted 2 papers 2 months ago

UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation

Paper • 2603.23500 • Published Mar 24 • 36

Complementary Reinforcement Learning

Paper • 2603.17621 • Published Mar 18 • 37

upvoted 2 papers 5 months ago

GARDO: Reinforcing Diffusion Models without Reward Hacking

Paper • 2512.24138 • Published Dec 30, 2025 • 30

SemanticGen: Video Generation in Semantic Space

Paper • 2512.20619 • Published Dec 23, 2025 • 95

upvoted a paper 6 months ago

Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach

Paper • 2512.02834 • Published Dec 2, 2025 • 41

upvoted a paper 8 months ago

Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards

Paper • 2509.24981 • Published Sep 29, 2025 • 29

upvoted a paper 12 months ago

Scaling Image and Video Generation via Test-Time Evolutionary Search

Paper • 2505.17618 • Published May 23, 2025 • 41

upvoted a paper over 1 year ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10, 2025 • 152