OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs Paper • 2411.14199 • Published Nov 21, 2024 • 34
AstaBench: Rigorous Benchmarking of AI Agents with a Scientific Research Suite Paper • 2510.21652 • Published Oct 24, 2025 • 4
PreScience: A Benchmark for Forecasting Scientific Contributions Paper • 2602.20459 • Published 16 days ago • 3