arxiv:2601.11044
Keyu Li (SII)
weizhihao1
AI & ML interests
Large Language Model Agent, Multi-Agent System
Recent Activity
updated
a dataset
3 days ago
GAIR/AgencyBench
commented on
a paper
6 days ago
AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts
published
a dataset
7 days ago
GAIR/AgencyBench