AutoResearch AI: Towards AI-Powered Research Automation for Scientific Discovery Paper • 2605.23204 • Published 5 days ago • 25 • 4
Can LLMs Correct Themselves? A Benchmark of Self-Correction in LLMs Paper • 2510.16062 • Published Oct 17, 2025 • 1 • 2
Can LLMs Correct Themselves? A Benchmark of Self-Correction in LLMs Paper • 2510.16062 • Published Oct 17, 2025 • 1 • 2
MMMR: Benchmarking Massive Multi-Modal Reasoning Tasks Paper • 2505.16459 • Published May 22, 2025 • 45 • 4
MMMR: Benchmarking Massive Multi-Modal Reasoning Tasks Paper • 2505.16459 • Published May 22, 2025 • 45 • 4
MMMR: Benchmarking Massive Multi-Modal Reasoning Tasks Paper • 2505.16459 • Published May 22, 2025 • 45 • 4