Grounded by Experience: Generative Healthcare Prediction Augmented with Hierarchical Agentic Retrieval

📅 2025-11-17
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address factual inaccuracy of large language models (LLMs) in healthcare prediction, this paper tackles two core challenges: determining optimal retrieval timing and achieving joint optimization between retriever and generator. We propose a Hierarchical Agent Retrieval (HAR) framework featuring a dual-agent architecture—comprising a retrieval agent and a generation agent—unified under a shared Markov Decision Process (MDP) to model their collaborative decision-making and enable experience-augmented, dynamic knowledge invocation. Technically, HAR integrates Retrieval-Augmented Generation (RAG), GraphRAG, hierarchical agent systems, and reinforcement learning. Evaluated on three mainstream healthcare benchmark tasks, HAR significantly outperforms state-of-the-art methods, demonstrating substantial improvements in prediction accuracy, knowledge adaptability, and inter-module coordination efficiency.

Technology Category

Application Category

📝 Abstract
Accurate healthcare prediction is critical for improving patient outcomes and reducing operational costs. Bolstered by growing reasoning capabilities, large language models (LLMs) offer a promising path to enhance healthcare predictions by drawing on their rich parametric knowledge. However, LLMs are prone to factual inaccuracies due to limitations in the reliability and coverage of their embedded knowledge. While retrieval-augmented generation (RAG) frameworks, such as GraphRAG and its variants, have been proposed to mitigate these issues by incorporating external knowledge, they face two key challenges in the healthcare scenario: (1) identifying the clinical necessity to activate the retrieval mechanism, and (2) achieving synergy between the retriever and the generator to craft contextually appropriate retrievals. To address these challenges, we propose GHAR, a underline{g}enerative underline{h}ierarchical underline{a}gentic underline{R}AG framework that simultaneously resolves when to retrieve and how to optimize the collaboration between submodules in healthcare. Specifically, for the first challenge, we design a dual-agent architecture comprising Agent-Top and Agent-Low. Agent-Top acts as the primary physician, iteratively deciding whether to rely on parametric knowledge or to initiate retrieval, while Agent-Low acts as the consulting service, summarising all task-relevant knowledge once retrieval was triggered. To tackle the second challenge, we innovatively unify the optimization of both agents within a formal Markov Decision Process, designing diverse rewards to align their shared goal of accurate prediction while preserving their distinct roles. Extensive experiments on three benchmark datasets across three popular tasks demonstrate our superiority over state-of-the-art baselines, highlighting the potential of hierarchical agentic RAG in advancing healthcare systems.
Problem

Research questions and friction points this paper is trying to address.

Addressing factual inaccuracies in LLMs for healthcare predictions
Resolving when to activate retrieval mechanisms in clinical scenarios
Optimizing synergy between retrieval and generation modules in healthcare
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hierarchical dual-agent architecture for retrieval decisions
Markov Decision Process optimization for agent collaboration
Generative framework integrating parametric and external knowledge
🔎 Similar Papers
Chuang Zhao
Chuang Zhao
PhD Candidate, The Hong Kong University of Science and Technology
AI for HealthcareRecommendation SystemTransfer Learning
H
Hui Tang
Department of Electronic and Computer Engineering, The Hong Kong University of Science and Technology, Hong Kong, SAR, China
H
Hongke Zhao
College of Management and Economics, Laboratory of Computation and Analytics of Complex Management Systems (CACMS), Tianjin University, Tianjin 30072, China
Xiaofang Zhou
Xiaofang Zhou
Hong Kong University of Science and Technology
databasesbig datadata scienceAI
Xiaomeng Li
Xiaomeng Li
Assistant Professor, The Hong Kong University of Science and Technology
Medical Image AnalysisAI in HealthcareDeep Learning