Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Chen Wang's picture

Chen Wang

wc597358816
  • 597358816

AI & ML interests

None yet

Recent Activity

authored a paper 5 days ago
Arbitrary Entropy Policy Optimization: Entropy Is Controllable in Reinforcement Fine-tuning
authored a paper 5 days ago
Distribution-Centric Policy Optimization Dominates Exploration-Exploitation Trade-off
updated a model 5 days ago
wc597358816/DCPO_Qwen3-4B
View all activity

Organizations

None yet

authored 2 papers 5 days ago

Arbitrary Entropy Policy Optimization: Entropy Is Controllable in Reinforcement Fine-tuning

Paper • 2510.08141 • Published Oct 9, 2025 • 1

Distribution-Centric Policy Optimization Dominates Exploration-Exploitation Trade-off

Paper • 2601.12730 • Published 9 days ago
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs