A2RBench: An Automatic Paradigm for Formally Verifiable Abstract Reasoning Benchmark Generation Paper • 2605.17278 • Published 3 days ago • 2
A2RBench: An Automatic Paradigm for Formally Verifiable Abstract Reasoning Benchmark Generation Paper • 2605.17278 • Published 3 days ago • 2
Benchmarking Abstract and Reasoning Abilities Through A Theoretical Perspective Paper • 2505.23833 • Published May 28, 2025 • 1