Zhiheng Xi's picture

Zhiheng Xi

WooooDyy

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 16 days ago

ResearchClawBench: A Benchmark for End-to-End Autonomous Scientific Research

upvoted a paper 27 days ago

AgentDoG 1.5: A Lightweight and Scalable Alignment Framework for AI Agent Safety and Security

authored a paper 3 months ago

AI Can Learn Scientific Taste

View all activity

Organizations

commented 3 papers 8 months ago

Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning

Paper • 2510.24320 • Published Oct 28, 2025 • 21 •

Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning

Paper • 2510.24320 • Published Oct 28, 2025 • 21 •

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published Oct 21, 2025 • 85 •

commented a paper over 1 year ago

Distill Visual Chart Reasoning Ability from LLMs to MLLMs

Paper • 2410.18798 • Published Oct 24, 2024 • 21 •