AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 578 items • Updated 3 days ago • 79
AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts Paper • 2601.11044 • Published 11 days ago • 34
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 578 items • Updated 3 days ago • 79
Paper2Rebuttal: A Multi-Agent Framework for Transparent Author Response Assistance Paper • 2601.14171 • Published 6 days ago • 47
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 578 items • Updated 3 days ago • 79
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 578 items • Updated 3 days ago • 79
MMDeepResearch-Bench: A Benchmark for Multimodal Deep Research Agents Paper • 2601.12346 • Published 9 days ago • 47
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 578 items • Updated 3 days ago • 79
FutureOmni: Evaluating Future Forecasting from Omni-Modal Context for Multimodal LLMs Paper • 2601.13836 • Published 7 days ago • 34
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 578 items • Updated 3 days ago • 79
ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development Paper • 2601.11077 • Published 11 days ago • 63
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 578 items • Updated 3 days ago • 79
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 578 items • Updated 3 days ago • 79
Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding Paper • 2601.10611 • Published 11 days ago • 26
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 578 items • Updated 3 days ago • 79
A^3-Bench: Benchmarking Memory-Driven Scientific Reasoning via Anchor and Attractor Activation Paper • 2601.09274 • Published 13 days ago • 83
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 578 items • Updated 3 days ago • 79