YutaoXie
AndreasX1206
AI & ML interests
None yet
Recent Activity
upvoted a paper 20 days ago
TIPS: Turn-Level Information-Potential Reward Shaping for Search-Augmented LLMs upvoted a paper 20 days ago
IsoCompute Playbook: Optimally Scaling Sampling Compute for LLM RL updated a model 7 months ago
AndreasX1206/test