Zhenghai Xue's picture

7 3

Zhenghai Xue

ZhenghaiXue

·

AI_Defender

AI & ML interests

Reinforcement Learning

Organizations

authored 2 papers 8 months ago

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published Sep 2, 2025 • 84

AgentStudio: A Toolkit for Building General Virtual Agents

Paper • 2403.17918 • Published Mar 26, 2024

authored a paper 12 months ago

Group-in-Group Policy Optimization for LLM Agent Training

Paper • 2505.10978 • Published May 16, 2025 • 20