Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Guanxing Lu's picture
2 27

Guanxing Lu

GuanxingLu
·
https://guanxinglu.github.io/
  • GuanxingLu

AI & ML interests

Computer Vision, Reinforcement Learning, etc.

Recent Activity

upvoted a paper about 6 hours ago
STARE: Surprisal-Guided Token-Level Advantage Reweighting for Policy Entropy Stability
liked a Space 17 days ago
WorldArena/WorldArena
updated a model about 1 month ago
GuanxingLu/momo-dapo-overlong-deepseek-r1-no-dpo-loss
View all activity

Organizations

None yet

models 14

GuanxingLu/momo-dapo-overlong-deepseek-r1-no-dpo-loss

8B • Updated May 6 • 4

GuanxingLu/momo-dpo-reverse-deepseek-r1-7b-anneal

8B • Updated May 4 • 4

GuanxingLu/momo-dpo-deepseek-r1-7b-abla-qwen3-1.7b

8B • Updated May 4 • 2

GuanxingLu/paper-momo-efficient-rloo-anneal-qwen25-math7b

8B • Updated May 4 • 4

GuanxingLu/paper-momo-thinkprune-qwen25-math7b

8B • Updated May 4 • 3

GuanxingLu/paper-momo-dapo-overlong-qwen25-math7b

8B • Updated May 4 • 3

GuanxingLu/momo-efficient-rloo-deepseek-r1-7b

8B • Updated May 3 • 4

GuanxingLu/paper-momo-efficient-rloo-qwen25-math7b

8B • Updated May 3 • 2

GuanxingLu/paper-momo-grpo-reverse-dpo-qwen25-math7b

8B • Updated May 3 • 3

GuanxingLu/paper-momo-grpo-qwen25-math7b

8B • Updated May 3 • 5
View 14 models

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs