Shenzhi Wang
shenzhi-wang
AI & ML interests
Large Language Model, Reinforcement Learning, and AI Agents
Recent Activity
authored a paper about 2 months ago
Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward Models authored a paper about 2 months ago
HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning