Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yihong Wu's picture
1 3

Yihong Wu

Yihong7788
ericray007's profile picture lihengma's profile picture
·

AI & ML interests

None yet

Organizations

None yet

upvoted 2 papers 4 months ago

It Takes Two: Your GRPO Is Secretly DPO

Paper • 2510.00977 • Published Oct 1, 2025 • 32

On Predictability of Reinforcement Learning Dynamics for Large Language Models

Paper • 2510.00553 • Published Oct 1, 2025 • 9
upvoted a paper 8 months ago

REARANK: Reasoning Re-ranking Agent via Reinforcement Learning

Paper • 2505.20046 • Published May 26, 2025 • 18
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs