| # Benchmark Overview | |
| *Date: 2025-06-30* | |
| Here we describe our dataset of statutory audit scenarios, evaluation metrics, and baseline results… | |
| # Benchmark Overview | |
| *Date: 2025-06-30* | |
| Here we describe our dataset of statutory audit scenarios, evaluation metrics, and baseline results… | |