Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
zhu's picture
5

zhu

zhu-thu-22
·

AI & ML interests

None yet

Recent Activity

updated a dataset about 1 month ago
zhu-thu-22/temp
published a dataset about 1 month ago
zhu-thu-22/temp
published a model 2 months ago
zhu-thu-22/GuardReasoner-Omni-4B
View all activity

Organizations

None yet

upvoted 2 papers 3 months ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Paper • 2601.08763 • Published Jan 13 • 150

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Paper • 2601.09667 • Published Jan 14 • 93
upvoted a paper 11 months ago

GuardReasoner-VL: Safeguarding VLMs via Reinforced Reasoning

Paper • 2505.11049 • Published May 16, 2025 • 61
upvoted a paper 12 months ago

FlowReasoner: Reinforcing Query-Level Meta-Agents

Paper • 2504.15257 • Published Apr 21, 2025 • 47
upvoted a paper about 1 year ago

Efficient Inference for Large Reasoning Models: A Survey

Paper • 2503.23077 • Published Mar 29, 2025 • 46
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs