Keyang Xuan PRO
keyangx3
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 23 hours ago
Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning
upvoted
a
paper
6 months ago
Sotopia-RL: Reward Design for Social Intelligence
Organizations
None yet