Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yikun Ban's picture
5 15 1

Yikun Ban

Yikunb
dark-pen's profile picture MachiaveIIi's profile picture hzxllll's profile picture
·

AI & ML interests

Reinforcement Learning

Recent Activity

upvoted a paper 15 days ago
Recursive Multi-Agent Systems
upvoted a paper about 2 months ago
Contextual Bandits with Online Neural Regression
upvoted a paper 2 months ago
Video-Based Reward Modeling for Computer-Use Agents
View all activity

Organizations

None yet

commented 2 papers 3 months ago

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

Paper • 2602.08354 • Published Feb 9 • 264 •
11

Weak-Driven Learning: How Weak Agents make Strong Agents Stronger

Paper • 2602.08222 • Published Feb 9 • 290 •
6
commented 2 papers 4 months ago

Your Group-Relative Advantage Is Biased

Paper • 2601.08521 • Published Jan 13 • 158 •
7

Your Group-Relative Advantage Is Biased

Paper • 2601.08521 • Published Jan 13 • 158 •
7
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs