tu's picture

tu

yihaotu

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

GD^2PO: Mitigating Multi-Reward Conflicts via Group-Dynamic reward-Decoupled Policy Optimization

upvoted a paper 21 days ago

Skill-RM: Unifying Heterogeneous Evaluation Criteria via Agent Skill

commentedon a paper 3 months ago

Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

View all activity

Organizations

upvoted a paper 12 days ago

GD^2PO: Mitigating Multi-Reward Conflicts via Group-Dynamic reward-Decoupled Policy Optimization

Paper • 2606.16771 • Published 14 days ago • 13

upvoted a paper 21 days ago

Skill-RM: Unifying Heterogeneous Evaluation Criteria via Agent Skill

Paper • 2606.03980 • Published 27 days ago • 13

upvoted a paper 3 months ago

Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

Paper • 2603.25158 • Published Mar 26 • 56