Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
Weizhi Xue
roseblooming
Follow
WeizhiXue80355
roseblooming
weizhi-xue-992606367
AI & ML interests
None yet
Recent Activity
new
activity
15 days ago
wzx111/Qwen3-1.7B-MATH-GDPO:
Which post-training method was actually used for this model, GDPO or GRPO?
View all activity
Organizations
None yet
roseblooming
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
wzx111/Qwen3-1.7B-MATH-GDPO
15 days ago
Which post-training method was actually used for this model, GDPO or GRPO?
1
#1 opened 15 days ago by
roseblooming