Xiangxin Zhou

zhouxiangxin

·

https://zhouxiangxin1998.github.io/

AI & ML interests

None yet

Recent Activity

authored a paper 21 days ago

Rethinking the Divergence Regularization in LLM RL

authored a paper 21 days ago

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

authored a paper 21 days ago

Beyond Uniform Token-Level Trust Region in LLM Reinforcement Learning

View all activity

Organizations

commented 2 papers 22 days ago

Rethinking the Divergence Regularization in LLM RL

Paper • 2606.09821 • Published 24 days ago • 33 •

Rethinking the Divergence Regularization in LLM RL

Paper • 2606.09821 • Published 24 days ago • 33 •