Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yifeng Liu's picture
1 2

Yifeng Liu

lyf07

AI & ML interests

None yet

Recent Activity

submitted a paper 1 day ago
Mending the Holes: Mitigating Reward Hacking in Reinforcement Learning for Multilingual Translation
authored a paper 3 days ago
R-PRM: Reasoning-Driven Process Reward Modeling
authored a paper 3 days ago
Mending the Holes: Mitigating Reward Hacking in Reinforcement Learning for Multilingual Translation
View all activity

Organizations

None yet

submitted a paper to Daily Papers 1 day ago

Mending the Holes: Mitigating Reward Hacking in Reinforcement Learning for Multilingual Translation

Paper • 2603.13045 • Published 8 days ago • 1
authored 2 papers 3 days ago

R-PRM: Reasoning-Driven Process Reward Modeling

Paper • 2503.21295 • Published Mar 27, 2025

Mending the Holes: Mitigating Reward Hacking in Reinforcement Learning for Multilingual Translation

Paper • 2603.13045 • Published 8 days ago • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs