Equipping Retrieval-Augmented Large Language Models with Document Structure Awareness

📅 2025-10-05
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing RAG methods treat documents as flat, unstructured text chunks, neglecting inherent structural information—thereby limiting multi-document reasoning and knowledge integration. To address this, we propose RDR2, the first end-to-end document-structure-aware retrieval-augmented generation framework. RDR2 employs an LLM-driven trainable router that dynamically navigates an explicitly modeled document structure tree; integrates automated action construction, structure-aware paragraph selection, and hierarchical evidence assembly to jointly optimize content relevance and hierarchical coherence; and refines routing decisions via reinforcement learning–inspired policy optimization. Evaluated on five challenging multi-document reasoning benchmarks, RDR2 achieves significant improvements in answer accuracy and information synthesis quality. Our results demonstrate that explicit structural modeling delivers a critical performance gain for RAG systems, advancing robust, interpretable, and contextually grounded reasoning over structured document collections.

Technology Category

Application Category

📝 Abstract
While large language models (LLMs) demonstrate impressive capabilities, their reliance on parametric knowledge often leads to factual inaccuracies. Retrieval-Augmented Generation (RAG) mitigates this by leveraging external documents, yet existing approaches treat retrieved passages as isolated chunks, ignoring valuable structure that is crucial for document organization. Motivated by this gap, we propose Retrieve-DocumentRoute-Read (RDR2), a novel framework that explicitly incorporates structural information throughout the RAG process. RDR2 employs an LLM-based router to dynamically navigate document structure trees, jointly evaluating content relevance and hierarchical relationships to assemble optimal evidence. Our key innovation lies in formulating document routing as a trainable task, with automatic action curation and structure-aware passage selection inspired by human reading strategies. Through comprehensive evaluation on five challenging datasets, RDR2 achieves state-of-the-art performance, demonstrating that explicit structural awareness significantly enhances RAG systems' ability to acquire and utilize knowledge, particularly in complex scenarios requiring multi-document synthesis.
Problem

Research questions and friction points this paper is trying to address.

Incorporating document structure awareness into retrieval-augmented generation systems
Overcoming limitations of treating retrieved passages as isolated chunks
Enhancing knowledge acquisition in complex multi-document scenarios
Innovation

Methods, ideas, or system contributions that make the work stand out.

Incorporates document structure trees in retrieval process
Uses trainable routing with automatic action curation
Selects passages based on hierarchical relationships
🔎 Similar Papers
No similar papers found.
L
Lingnan Xu
School of Computer Science, Beijing Institute of Technology
C
Chong Feng
Southeast Academy of Information Technology, Beijing Institute of Technology
K
Kaiyuan Zhang
School of Computer Science, Beijing Institute of Technology
L
Liu Zhengyong
Ant Group
Wenqiang Xu
Wenqiang Xu
Shanghai Jiao Tong University
Computer visionRobotics
Fanqing Meng
Fanqing Meng
Phd Student, Shanghai Jiaotong University
Multimodel LearningTransfer LearningLarge Language Model