Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
1
3
Weizhi Xue
roseblooming
Follow
roseblooming
weizhi-xue-992606367
AI & ML interests
None yet
Organizations
None yet
roseblooming
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
wzx111/Qwen3-1.7B-MATH-GDPO
5 months ago
Which post-training method was actually used for this model, GDPO or GRPO?
1
#1 opened 5 months ago by
roseblooming