Guanxing Lu
GuanxingLu
ยท
AI & ML interests
Computer Vision, Reinforcement Learning, etc.
Recent Activity
upvoted a paper about 11 hours ago
STARE: Surprisal-Guided Token-Level Advantage Reweighting for Policy Entropy Stability liked a Space 18 days ago
WorldArena/WorldArena updated a model about 1 month ago
GuanxingLu/momo-dapo-overlong-deepseek-r1-no-dpo-lossOrganizations
None yet