AgentSearchBench: A Benchmark for AI Agent Search in the Wild Paper • 2604.22436 • Published 17 days ago • 14
InnoEval: On Research Idea Evaluation as a Knowledge-Grounded, Multi-Perspective Reasoning Problem Paper • 2602.14367 • Published Feb 16 • 17