Yaochen Zhu's picture

Yaochen Zhu

yaochenzhu

·

https://www.ychzhu.com

yaochenzhu

AI & ML interests

Causal Inference, Deep Probabilistic Models, Recommender Systems

Recent Activity

upvoted a paper about 1 month ago

You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories

upvoted a paper 8 months ago

SemCoT: Accelerating Chain-of-Thought Reasoning through Semantically-Aligned Implicit Tokens

upvoted a paper 8 months ago

Rank-GRPO: Training LLM-based Conversational Recommender Systems with Reinforcement Learning

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories

Paper • 2605.21468 • Published May 20 • 50

upvoted 2 papers 8 months ago

SemCoT: Accelerating Chain-of-Thought Reasoning through Semantically-Aligned Implicit Tokens

Paper • 2510.24940 • Published Oct 28, 2025 • 18

Rank-GRPO: Training LLM-based Conversational Recommender Systems with Reinforcement Learning

Paper • 2510.20150 • Published Oct 23, 2025 • 7

commented 2 papers 8 months ago

SemCoT: Accelerating Chain-of-Thought Reasoning through Semantically-Aligned Implicit Tokens

Paper • 2510.24940 • Published Oct 28, 2025 • 18 •

Rank-GRPO: Training LLM-based Conversational Recommender Systems with Reinforcement Learning

Paper • 2510.20150 • Published Oct 23, 2025 • 7 •

authored 2 papers 8 months ago

Rank-GRPO: Training LLM-based Conversational Recommender Systems with Reinforcement Learning

Paper • 2510.20150 • Published Oct 23, 2025 • 7

SemCoT: Accelerating Chain-of-Thought Reasoning through Semantically-Aligned Implicit Tokens

Paper • 2510.24940 • Published Oct 28, 2025 • 18

upvoted a paper 9 months ago

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

Paper • 2509.25760 • Published Sep 30, 2025 • 55

published a dataset about 1 year ago

yaochenzhu/reddit-v2

Updated Jun 4, 2025 • 8