Yifeng Liu's picture

Yifeng Liu

lyf07

·

AI & ML interests

None yet

Recent Activity

new activity about 2 months ago

lyf07/LLaMAX3-8B-Alpaca-WALAR:Add library_name and pipeline_tag metadata

new activity about 2 months ago

lyf07/Qwen3-8B-WALAR:Add library_name and pipeline_tag to metadata

new activity about 2 months ago

lyf07/Translategemma-4B-it-WALAR:Add pipeline tag and library metadata

View all activity

Organizations

None yet

submitted a paper to Daily Papers about 2 months ago

Mending the Holes: Mitigating Reward Hacking in Reinforcement Learning for Multilingual Translation

Paper • 2603.13045 • Published Mar 13 • 2

authored 2 papers about 2 months ago

R-PRM: Reasoning-Driven Process Reward Modeling

Paper • 2503.21295 • Published Mar 27, 2025

Mending the Holes: Mitigating Reward Hacking in Reinforcement Learning for Multilingual Translation

Paper • 2603.13045 • Published Mar 13 • 2