Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
zhu's picture
5

zhu

zhu-thu-22
·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago
Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs
upvoted a paper 1 day ago
Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning
updated a dataset 4 days ago
zhu-thu-22/mm_dataset
View all activity

Organizations

None yet

upvoted 2 papers 1 day ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Paper • 2601.08763 • Published 4 days ago • 116

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Paper • 2601.09667 • Published 3 days ago • 67
updated a dataset 4 days ago

zhu-thu-22/mm_dataset

Viewer • Updated 4 days ago • 12.1k • 24
published a dataset 4 days ago

zhu-thu-22/mm_dataset

Viewer • Updated 4 days ago • 12.1k • 24
upvoted a paper 8 months ago

GuardReasoner-VL: Safeguarding VLMs via Reinforced Reasoning

Paper • 2505.11049 • Published May 16, 2025 • 60
upvoted a paper 9 months ago

FlowReasoner: Reinforcing Query-Level Meta-Agents

Paper • 2504.15257 • Published Apr 21, 2025 • 47
upvoted a paper 10 months ago

Efficient Inference for Large Reasoning Models: A Survey

Paper • 2503.23077 • Published Mar 29, 2025 • 46
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs