Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Xiangxin Zhou's picture

Xiangxin Zhou

zhouxiangxin
3 21 4
Gargaz's profile picture MeowFET's profile picture JohnRoger's profile picture
·
https://zhouxiangxin1998.github.io/

AI & ML interests

None yet

Recent Activity

authored a paper 21 days ago
Rethinking the Divergence Regularization in LLM RL
authored a paper 21 days ago
Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models
authored a paper 21 days ago
Beyond Uniform Token-Level Trust Region in LLM Reinforcement Learning
View all activity

Organizations

ezetimibe's profile picture benzeneRing's profile picture ProteinBench's profile picture Axon RL's profile picture AIFirstScience's profile picture substance0723's profile picture cruise0724's profile picture substance0724's profile picture bioterminal's profile picture Tencent-Hunyuan-Multimodal-RL's profile picture harnessRL's profile picture

commented 2 papers 22 days ago

Rethinking the Divergence Regularization in LLM RL

Paper • 2606.09821 • Published 24 days ago • 33 •
4

Rethinking the Divergence Regularization in LLM RL

Paper • 2606.09821 • Published 24 days ago • 33 •
4
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs