ZhuoPeng
zhanxing
ยท
AI & ML interests
None yet
Recent Activity
updated a model 8 days ago
zhanxing/CS60003-HW3 published a model 19 days ago
zhanxing/CS60003-HW3 upvoted a paper about 1 month ago
ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation