ReasonTabQA: A Comprehensive Benchmark for Table Question Answering from Real World Industrial Scenarios

📅 2026-01-12
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing table-based question answering benchmarks struggle to address the complex reasoning challenges posed by real-world industrial scenarios, such as multi-table relationships, nested headers, and large-scale data. To bridge this gap, this work introduces ReasonTabQA, the first high-complexity bilingual table QA benchmark tailored for authentic industrial applications, encompassing 30 domains and 1,932 tables with annotated answers and explicit reasoning chains, supporting both chain-of-thought and non-chain-of-thought paradigms. Furthermore, the authors propose TabCodeRL, a method that integrates table structure awareness with a verifiable reasoning reward mechanism, leveraging reinforcement learning to guide large language models in generating logically sound and verifiable reasoning paths. Experiments demonstrate that TabCodeRL significantly improves performance on open-source models, yet a notable gap remains compared to human-level accuracy, underscoring the inherent difficulty of industrial-scale table question answering.

Technology Category

Application Category

📝 Abstract
Recent advancements in Large Language Models (LLMs) have significantly catalyzed table-based question answering (TableQA). However, existing TableQA benchmarks often overlook the intricacies of industrial scenarios, which are characterized by multi-table structures, nested headers, and massive scales. These environments demand robust table reasoning through deep structured inference, presenting a significant challenge that remains inadequately addressed by current methodologies. To bridge this gap, we present ReasonTabQA, a large-scale bilingual benchmark encompassing 1,932 tables across 30 industry domains such as energy and automotive. ReasonTabQA provides high-quality annotations for both final answers and explicit reasoning chains, supporting both thinking and no-thinking paradigms. Furthermore, we introduce TabCodeRL, a reinforcement learning method that leverages table-aware verifiable rewards to guide the generation of logical reasoning paths. Extensive experiments on ReasonTabQA and 4 TableQA datasets demonstrate that while TabCodeRL yields substantial performance gains on open-source LLMs, the persistent performance gap on ReasonTabQA underscores the inherent complexity of real-world industrial TableQA.
Problem

Research questions and friction points this paper is trying to address.

Table Question Answering
Industrial Scenarios
Multi-table Structures
Nested Headers
Structured Reasoning
Innovation

Methods, ideas, or system contributions that make the work stand out.

Table Question Answering
Industrial Benchmark
Reasoning Chain
Reinforcement Learning
Structured Inference
🔎 Similar Papers
No similar papers found.
C
Changzai Pan
Institute of Artificial Intelligence (TeleAI), China Telecom
J
Jie Zhang
Institute of Artificial Intelligence (TeleAI), China Telecom
K
Kaiwen Wei
Chongqing University
C
Chenshuo Pan
Institute of Artificial Intelligence (TeleAI), China Telecom
Y
Yu Zhao
Institute of Artificial Intelligence (TeleAI), China Telecom
J
Jingwang Huang
Institute of Artificial Intelligence (TeleAI), China Telecom
J
Jian Yang
Beihang University
Z
Zhenhe Wu
Institute of Artificial Intelligence (TeleAI), China Telecom
Haoyang Zeng
Haoyang Zeng
Xaira Theurapeutics
Machine LearningProtein DesignPeptide VaccineGene Regulation
X
Xiaoyan Gu
Institute of Artificial Intelligence (TeleAI), China Telecom
Weichao Sun
Weichao Sun
Harbin Institute of Technology
MechatronicsMotion Control
Y
Yanbo Zhai
Institute of Artificial Intelligence (TeleAI), China Telecom
Y
Yujie Mao
Institute of Artificial Intelligence (TeleAI), China Telecom
Z
Zhuoru Jiang
Institute of Artificial Intelligence (TeleAI), China Telecom
J
Jiang Zhong
Chongqing University
S
Shuangyong Song
Institute of Artificial Intelligence (TeleAI), China Telecom
Yongxiang Li
Yongxiang Li
Professor, RMIT University
Electronic Materials and Devices
Z
Zhongjiang He
Institute of Artificial Intelligence (TeleAI), China Telecom