Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
3
21
4
Xiangxin Zhou
zhouxiangxin
Follow
21world's profile picture
Fishtiks's profile picture
Gargaz's profile picture
6 followers
·
12 following
https://zhouxiangxin1998.github.io/
AI & ML interests
None yet
Recent Activity
authored
a paper
14 days ago
Rethinking the Divergence Regularization in LLM RL
authored
a paper
14 days ago
Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models
authored
a paper
14 days ago
Beyond Uniform Token-Level Trust Region in LLM Reinforcement Learning
View all activity
Organizations
zhouxiangxin
's datasets
24
Sort: Recently updated
zhouxiangxin/TACO_subset
Viewer
•
Updated
Sep 28, 2025
•
4.24k
•
5
zhouxiangxin/apps
Viewer
•
Updated
Sep 28, 2025
•
5k
•
8
zhouxiangxin/numina_all_subsets_formatted
Viewer
•
Updated
Sep 28, 2025
•
39k
•
7
zhouxiangxin/Variational-Posterior-4B-Acc-mix
Viewer
•
Updated
Sep 28, 2025
•
33.4k
•
19
•
1
zhouxiangxin/Variational-Posterior-4B-GML-mix
Viewer
•
Updated
Sep 28, 2025
•
33.4k
•
20
zhouxiangxin/Variational-Posterior-8B-Acc-mix
Viewer
•
Updated
Sep 28, 2025
•
33.4k
•
13
zhouxiangxin/Variational-Posterior-8B-GML-mix
Viewer
•
Updated
Sep 28, 2025
•
33.4k
•
36
zhouxiangxin/Variational-Posterior-32B-Acc-mix
Viewer
•
Updated
Sep 28, 2025
•
33.4k
•
8
zhouxiangxin/Variational-Posterior-32B-GML-mix
Viewer
•
Updated
Sep 28, 2025
•
33.4k
•
41
zhouxiangxin/Variational-Posterior-PB-7B-Acc-mix
Viewer
•
Updated
Sep 28, 2025
•
33.4k
•
43
zhouxiangxin/Variational-Posterior-PB-7B-GML-mix
Viewer
•
Updated
Sep 28, 2025
•
33.4k
•
3
zhouxiangxin/Variational-Posterior-PA-7B-Acc-mix
Viewer
•
Updated
Sep 28, 2025
•
33.4k
•
8
zhouxiangxin/Variational-Posterior-PA-7B-GML-mix
Viewer
•
Updated
Sep 28, 2025
•
33.4k
•
6
zhouxiangxin/Bespoke-Stratos-17k-Reasoning
Viewer
•
Updated
Sep 28, 2025
•
16.7k
•
8
zhouxiangxin/Bespoke-Stratos-17k-Source
Viewer
•
Updated
Sep 28, 2025
•
16.7k
•
4
zhouxiangxin/AIME2025
Viewer
•
Updated
Sep 28, 2025
•
30
•
3
zhouxiangxin/Bespoke-Stratos-17k-Reformatted
Viewer
•
Updated
Sep 28, 2025
•
16.7k
•
3
zhouxiangxin/Bespoke-Stratos-17k-Train-Posterior-PA
Viewer
•
Updated
Sep 28, 2025
•
16.7k
•
4
zhouxiangxin/Bespoke-Stratos-17k-Train-Posterior-PB
Viewer
•
Updated
Sep 28, 2025
•
16.7k
•
4
zhouxiangxin/generalthought_306k
Viewer
•
Updated
Apr 24, 2025
•
306k
•
15
zhouxiangxin/webinstruct_232k
Viewer
•
Updated
Apr 18, 2025
•
232k
•
14
zhouxiangxin/gsm8k_r1
Viewer
•
Updated
Mar 3, 2025
•
8.78k
•
48
zhouxiangxin/DualDiff
Updated
Dec 16, 2024
•
10
•
2
zhouxiangxin/metformin
Updated
May 2, 2024
•
2