Papers
arxiv:2503.12345

General Table Question Answering via Answer-Formula Joint Generation

Published on Mar 16, 2025
Authors:
,
,
,

Abstract

A novel table question answering framework uses spreadsheet formulas as executable representations, achieving state-of-the-art performance on multiple table datasets.

AI-generated summary

Advanced table question answering (TableQA) methods prompt large language models (LLMs) to generate answer text, SQL query, Python code, or custom operation, which impressively improve the complex reasoning problems in the TableQA task. However, these methods lack the versatility to cope with specific question types or table structures. In contrast, the Spreadsheet Formula, the widely used and well-defined operation language for tabular data, has not been thoroughly explored to solve TableQA. In this paper, we first attempt to use the Formula as the executable representation for solving complex reasoning on tables with different structures. Specifically, we construct FromulaQA, a large Formula-annotated TableQA dataset from existing datasets. In addition, we propose TabAF, a general table answering framework to solve multiple types of tasks over multiple types of tables simultaneously, which decodes answers and Formulas with a single LLM backbone. Extensive experiments demonstrate the versatility and generalization of TabAF. Under the same model size, TabAF achieves new state-of-the-art performance on the WikiTableQuestion, HiTab, and TabFact.

Community

Sign up or log in to comment

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2503.12345 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2503.12345 in a dataset README.md to link it from this page.

Spaces citing this paper 1

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.