Retrieve, Summarize, Plan: Advancing Multi-hop Question Answering with an Iterative Approach

📅 2024-07-18
🏛️ arXiv.org
📈 Citations: 3
Influential: 0
📄 PDF
🤖 AI Summary
Multi-hop question answering faces challenges including difficulty in acquiring all necessary evidence in a single retrieval step, information overload from iterative retrieval, and lack of explicit process tracking. This paper proposes ReSP, an iterative retrieval-augmented generation framework. ReSP introduces a novel dual-objective summarizer that concurrently compresses information relevant to both the global question and dynamically generated sub-questions. It incorporates a retrieval trajectory memory mechanism to explicitly model and record retrieval paths, thereby suppressing redundant planning. Additionally, it leverages LLM-driven sub-question decomposition and verification for lightweight, efficient iterative information integration. Evaluated on HotpotQA and 2WikiMultihopQA, ReSP achieves significant improvements over state-of-the-art methods, demonstrates strong robustness to long contexts, improves inference efficiency by 23%, and attains a summary compression rate of 68%.

Technology Category

Application Category

📝 Abstract
Multi-hop question answering is a challenging task with distinct industrial relevance, and Retrieval-Augmented Generation (RAG) methods based on large language models (LLMs) have become a popular approach to tackle this task. Owing to the potential inability to retrieve all necessary information in a single iteration, a series of iterative RAG methods has been recently developed, showing significant performance improvements. However, existing methods still face two critical challenges: context overload resulting from multiple rounds of retrieval, and over-planning and repetitive planning due to the lack of a recorded retrieval trajectory. In this paper, we propose a novel iterative RAG method called ReSP, equipped with a dual-function summarizer. This summarizer compresses information from retrieved documents, targeting both the overarching question and the current sub-question concurrently. Experimental results on the multi-hop question-answering datasets HotpotQA and 2WikiMultihopQA demonstrate that our method significantly outperforms the state-of-the-art, and exhibits excellent robustness concerning context length.
Problem

Research questions and friction points this paper is trying to address.

Multi-hop Question Answering
Information Overload
Clue Tracking
Innovation

Methods, ideas, or system contributions that make the work stand out.

ReSP Method
Information Extraction
Multi-hop Question Answering
🔎 Similar Papers
No similar papers found.