Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
2
27
Guanxing Lu
GuanxingLu
Follow
0 followers
·
2 following
https://guanxinglu.github.io/
GuanxingLu
AI & ML interests
Computer Vision, Reinforcement Learning, etc.
Recent Activity
upvoted
a
paper
about 6 hours ago
STARE: Surprisal-Guided Token-Level Advantage Reweighting for Policy Entropy Stability
liked
a Space
17 days ago
WorldArena/WorldArena
updated
a model
about 1 month ago
GuanxingLu/momo-dapo-overlong-deepseek-r1-no-dpo-loss
View all activity
Organizations
None yet
models
14
Sort: Recently updated
GuanxingLu/momo-dapo-overlong-deepseek-r1-no-dpo-loss
8B
•
Updated
May 6
•
4
GuanxingLu/momo-dpo-reverse-deepseek-r1-7b-anneal
8B
•
Updated
May 4
•
4
GuanxingLu/momo-dpo-deepseek-r1-7b-abla-qwen3-1.7b
8B
•
Updated
May 4
•
2
GuanxingLu/paper-momo-efficient-rloo-anneal-qwen25-math7b
8B
•
Updated
May 4
•
4
GuanxingLu/paper-momo-thinkprune-qwen25-math7b
8B
•
Updated
May 4
•
3
GuanxingLu/paper-momo-dapo-overlong-qwen25-math7b
8B
•
Updated
May 4
•
3
GuanxingLu/momo-efficient-rloo-deepseek-r1-7b
8B
•
Updated
May 3
•
4
GuanxingLu/paper-momo-efficient-rloo-qwen25-math7b
8B
•
Updated
May 3
•
2
GuanxingLu/paper-momo-grpo-reverse-dpo-qwen25-math7b
8B
•
Updated
May 3
•
3
GuanxingLu/paper-momo-grpo-qwen25-math7b
8B
•
Updated
May 3
•
5
View 14 models
datasets
0
None public yet