tu's picture

tu

yihaotu

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

GD^2PO: Mitigating Multi-Reward Conflicts via Group-Dynamic reward-Decoupled Policy Optimization

upvoted a paper 18 days ago

Skill-RM: Unifying Heterogeneous Evaluation Criteria via Agent Skill

commentedon a paper 3 months ago

Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

View all activity

Organizations

upvoted a paper 9 days ago

GD^2PO: Mitigating Multi-Reward Conflicts via Group-Dynamic reward-Decoupled Policy Optimization

Paper • 2606.16771 • Published 11 days ago • 13

upvoted a paper 18 days ago

Skill-RM: Unifying Heterogeneous Evaluation Criteria via Agent Skill

Paper • 2606.03980 • Published 24 days ago • 13

commented a paper 3 months ago

Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

Paper • 2603.25158 • Published Mar 26 • 56 •

upvoted a paper 3 months ago

Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

Paper • 2603.25158 • Published Mar 26 • 56

updated a dataset 5 months ago

yihaotu/mind2web-utg-crawl

Updated Jan 31 • 323

published a dataset 5 months ago

yihaotu/mind2web-utg-crawl

Updated Jan 31 • 323

published a model 12 months ago

yihaotu/notebook-rl

Updated Jul 11, 2025