Guanxing Lu's picture

Guanxing Lu

GuanxingLu

·

https://guanxinglu.github.io/

GuanxingLu

AI & ML interests

Computer Vision, Reinforcement Learning, etc.

Recent Activity

upvoted a paper about 6 hours ago

STARE: Surprisal-Guided Token-Level Advantage Reweighting for Policy Entropy Stability

liked a Space 17 days ago

WorldArena/WorldArena

updated a model about 1 month ago

GuanxingLu/momo-dapo-overlong-deepseek-r1-no-dpo-loss

View all activity

Organizations

None yet

models 14

GuanxingLu/momo-dapo-overlong-deepseek-r1-no-dpo-loss

8B • Updated May 6 • 4

GuanxingLu/momo-dpo-reverse-deepseek-r1-7b-anneal

8B • Updated May 4 • 4

GuanxingLu/momo-dpo-deepseek-r1-7b-abla-qwen3-1.7b

8B • Updated May 4 • 2

GuanxingLu/paper-momo-efficient-rloo-anneal-qwen25-math7b

8B • Updated May 4 • 4

GuanxingLu/paper-momo-thinkprune-qwen25-math7b

8B • Updated May 4 • 3

GuanxingLu/paper-momo-dapo-overlong-qwen25-math7b

8B • Updated May 4 • 3

GuanxingLu/momo-efficient-rloo-deepseek-r1-7b

8B • Updated May 3 • 4

GuanxingLu/paper-momo-efficient-rloo-qwen25-math7b

8B • Updated May 3 • 2

GuanxingLu/paper-momo-grpo-reverse-dpo-qwen25-math7b

8B • Updated May 3 • 3

GuanxingLu/paper-momo-grpo-qwen25-math7b

8B • Updated May 3 • 5

datasets 0

None public yet