StepChain GraphRAG: Reasoning Over Knowledge Graphs for Multi-Hop Question Answering

📅 2025-10-03
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the challenges of poor coordination between reasoning and external knowledge retrieval, as well as limited interpretability in multi-hop question answering, this paper proposes a dynamic knowledge graph construction framework that integrates question decomposition with breadth-first search (BFS)-guided reasoning. The method retrieves and structurally organizes external knowledge on-demand during inference, explicitly generating multi-hop evidence chains to enable synchronous evolution of reasoning paths and their supporting knowledge. Core techniques include retrieval-augmented generation, dynamic knowledge subgraph construction, stepwise question decomposition, and BFS-driven iterative reasoning. Evaluated on MuSiQue, 2WikiMultiHopQA, and HotpotQA, the approach achieves state-of-the-art performance: average exact match (EM) improves by 2.57% and F1 by 2.13%; on HotpotQA specifically, EM and F1 increase by 4.70% and 3.44%, respectively—demonstrating substantial gains in both accuracy and interpretability.

Technology Category

Application Category

📝 Abstract
Recent progress in retrieval-augmented generation (RAG) has led to more accurate and interpretable multi-hop question answering (QA). Yet, challenges persist in integrating iterative reasoning steps with external knowledge retrieval. To address this, we introduce StepChain GraphRAG, a framework that unites question decomposition with a Breadth-First Search (BFS) Reasoning Flow for enhanced multi-hop QA. Our approach first builds a global index over the corpus; at inference time, only retrieved passages are parsed on-the-fly into a knowledge graph, and the complex query is split into sub-questions. For each sub-question, a BFS-based traversal dynamically expands along relevant edges, assembling explicit evidence chains without overwhelming the language model with superfluous context. Experiments on MuSiQue, 2WikiMultiHopQA, and HotpotQA show that StepChain GraphRAG achieves state-of-the-art Exact Match and F1 scores. StepChain GraphRAG lifts average EM by 2.57% and F1 by 2.13% over the SOTA method, achieving the largest gain on HotpotQA (+4.70% EM, +3.44% F1). StepChain GraphRAG also fosters enhanced explainability by preserving the chain-of-thought across intermediate retrieval steps. We conclude by discussing how future work can mitigate the computational overhead and address potential hallucinations from large language models to refine efficiency and reliability in multi-hop QA.
Problem

Research questions and friction points this paper is trying to address.

Integrating iterative reasoning with knowledge retrieval
Building dynamic knowledge graphs from retrieved passages
Splitting complex queries into sub-questions for traversal
Innovation

Methods, ideas, or system contributions that make the work stand out.

Builds a global index over the corpus
Parses retrieved passages into a knowledge graph
Uses BFS-based traversal for dynamic evidence chains
🔎 Similar Papers
No similar papers found.
T
Tengjun Ni
University of Technology Sydney, Australia
X
Xin Yuan
Data61, CSIRO, Australia
Shenghong Li
Shenghong Li
Data61, CSIRO, Australia
K
Kai Wu
University of Technology Sydney, Australia
Ren Ping Liu
Ren Ping Liu
University of Technology Sydney
Wireless NetworkingNetwork SecurityBlockchain
Wei Ni
Wei Ni
FIEEE, AAIA Fellow, Senior Principal Scientist & Conjoint Professor, CSIRO/UNSW
6G security and privacyconnected and trusted intelligenceapplied AI/ML
W
Wenjie Zhang
University of New South Wales, Australia