jiangyuhao's picture

5 8

jiangyuhao

JYuhao88

·

JYuhao88

AI & ML interests

None yet

Recent Activity

upvoted a paper about 13 hours ago

Beyond Scalar Rewards by Internalizing Reasoning into Score Distributions

liked a dataset 1 day ago

SciCode/SciCode-Programming-Problems

upvoted a paper 7 months ago

DRIVE: Data Curation Best Practices for Reinforcement Learning with Verifiable Reward in Competitive Code Generation

View all activity

Organizations

None yet

upvoted a paper about 13 hours ago

Beyond Scalar Rewards by Internalizing Reasoning into Score Distributions

Paper • 2606.09076 • Published 4 days ago • 47

upvoted a paper 7 months ago

DRIVE: Data Curation Best Practices for Reinforcement Learning with Verifiable Reward in Competitive Code Generation

Paper • 2511.06307 • Published Nov 9, 2025 • 53

upvoted a paper 8 months ago

Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

Paper • 2510.03222 • Published Oct 3, 2025 • 76

upvoted a paper 9 months ago

Reinforcement Learning on Pre-Training Data

Paper • 2509.19249 • Published Sep 23, 2025 • 67

upvoted an article over 1 year ago

Article

ZebraLogic: Benchmarking the Logical Reasoning Ability of Language Models

yuchenlin

•

Jul 27, 2024

• 35