Shenzhi Wang
shenzhi-wang
AI & ML interests
Large Language Model, Reinforcement Learning, and AI Agents
Recent Activity
upvoted a paper about 1 month ago
OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration upvoted a paper about 1 month ago
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models upvoted a paper about 1 month ago
SWE-Universe: Scale Real-World Verifiable Environments to Millions