AirQA: A Comprehensive QA Dataset for AI Research with Instance-Level Evaluation Paper • 2509.16952 • Published Sep 21, 2025
Externalizing Research Synthesis and Validation in AI Scientists through a Research Harness Paper • 2606.18874 • Published 11 days ago • 6
Externalizing Research Synthesis and Validation in AI Scientists through a Research Harness Paper • 2606.18874 • Published 11 days ago • 6