Mingzhe Li
Mubuky
ยท
AI & ML interests
RL & Agent
Recent Activity
upvoted
a
paper
12 days ago
ArenaRL: Scaling RL for Open-Ended Agents via Tournament-based Relative Ranking