Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Zhiheng Xi's picture
4 13

Zhiheng Xi

WooooDyy
21world's profile picture rbao2018's profile picture seleixi's profile picture
·

AI & ML interests

None yet

Recent Activity

upvoted a collection about 7 hours ago
AgentDoG
authored a paper 7 days ago
ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development
upvoted a paper 7 days ago
ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development
View all activity

Organizations

AgentGym's profile picture MathCritique's profile picture Nex AGI's profile picture

commented 3 papers 3 months ago

Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning

Paper • 2510.24320 • Published Oct 28, 2025 • 21 •
3

Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning

Paper • 2510.24320 • Published Oct 28, 2025 • 21 •
3

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published Oct 21, 2025 • 84 •
3
commented a paper over 1 year ago

Distill Visual Chart Reasoning Ability from LLMs to MLLMs

Paper • 2410.18798 • Published Oct 24, 2024 • 21 •
5
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs