AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 641 items • Updated 11 days ago • 98
WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces Paper • 2606.09426 • Published 18 days ago • 102
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 641 items • Updated 11 days ago • 98
CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence Paper • 2605.12882 • Published May 13 • 274
MolmoAct2: Action Reasoning Models for Real-world Deployment Paper • 2605.02881 • Published May 4 • 355
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 641 items • Updated 11 days ago • 98
Intern-Atlas: A Methodological Evolution Graph as Research Infrastructure for AI Scientists Paper • 2604.28158 • Published Apr 30 • 49
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 641 items • Updated 11 days ago • 98
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 641 items • Updated 11 days ago • 98
SkillClaw: Let Skills Evolve Collectively with Agentic Evolver Paper • 2604.08377 • Published Apr 9 • 295
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 641 items • Updated 11 days ago • 98
Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents Paper • 2604.06132 • Published Apr 7 • 122
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 641 items • Updated 11 days ago • 98
AgentHazard: A Benchmark for Evaluating Harmful Behavior in Computer-Use Agents Paper • 2604.02947 • Published Apr 3 • 19
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 641 items • Updated 11 days ago • 98
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 641 items • Updated 11 days ago • 98
SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization Paper • 2604.02268 • Published Apr 2 • 102