Shenzhi Yang's picture

Shenzhi Yang

Shenzhi

·

AI & ML interests

None yet

Recent Activity

upvoted a collection about 4 hours ago

WTF GENIUS PAPERS

upvoted a paper 1 day ago

OPRD: On-Policy Representation Distillation

submitted a paper 1 day ago

OPRD: On-Policy Representation Distillation

View all activity

Organizations

None yet

upvoted a collection about 4 hours ago

WTF GENIUS PAPERS

Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 167 items • Updated about 13 hours ago • 33

upvoted a paper 1 day ago

OPRD: On-Policy Representation Distillation

Paper • 2606.06021 • Published 2 days ago • 6

upvoted an article about 1 month ago

Article

From GRPO to DAPO and GSPO: What, Why, and How

NormalUhr

•

Aug 9, 2025

• 121

upvoted a paper about 2 months ago

Can LLMs Learn to Reason Robustly under Noisy Supervision?

Paper • 2604.03993 • Published Apr 5 • 43

upvoted a paper 5 months ago

Your Group-Relative Advantage Is Biased

Paper • 2601.08521 • Published Jan 13 • 158

upvoted a paper 6 months ago

TraPO: A Semi-Supervised Reinforcement Learning Framework for Boosting LLM Reasoning

Paper • 2512.13106 • Published Dec 15, 2025 • 4