MedPAO: A Protocol-Driven Agent for Structuring Medical Reports

📅 2025-10-06
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Large language models (LLMs) suffer from factual hallucination and weak adherence to domain-specific clinical rules in medical report structuring. To address these limitations, we propose MedPAO—a clinical-protocol-guided agent framework built upon a Plan-Act-Observe reasoning loop. MedPAO explicitly encodes standardized clinical protocols (e.g., ABCDEF for CXR interpretation) as hard constraints within the LLM’s inference process and integrates specialized tools to ensure interpretable, traceable, and protocol-compliant structured generation. The framework jointly supports concept classification and structured output generation via task decomposition, protocol-driven decision-making, and tool-augmented execution. Experiments demonstrate state-of-the-art performance: 0.96 F1-score on concept classification and an average clinical expert rating of 4.52/5 on structured outputs—significantly outperforming LLM-only baselines. MedPAO is the first approach to deeply embed clinical protocols into the LLM’s closed-loop reasoning, thereby enhancing accuracy, trustworthiness, and professional compliance in medical report structuring.

Technology Category

Application Category

📝 Abstract
The deployment of Large Language Models (LLMs) for structuring clinical data is critically hindered by their tendency to hallucinate facts and their inability to follow domain-specific rules. To address this, we introduce MedPAO, a novel agentic framework that ensures accuracy and verifiable reasoning by grounding its operation in established clinical protocols such as the ABCDEF protocol for CXR analysis. MedPAO decomposes the report structuring task into a transparent process managed by a Plan-Act-Observe (PAO) loop and specialized tools. This protocol-driven method provides a verifiable alternative to opaque, monolithic models. The efficacy of our approach is demonstrated through rigorous evaluation: MedPAO achieves an F1-score of 0.96 on the critical sub-task of concept categorization. Notably, expert radiologists and clinicians rated the final structured outputs with an average score of 4.52 out of 5, indicating a level of reliability that surpasses baseline approaches relying solely on LLM-based foundation models. The code is available at: https://github.com/MiRL-IITM/medpao-agent
Problem

Research questions and friction points this paper is trying to address.

Addressing LLM hallucinations in clinical data structuring tasks
Ensuring adherence to domain-specific medical protocols like ABCDEF
Providing verifiable reasoning for medical report structuring processes
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses Plan-Act-Observe loop for transparent reasoning
Relies on clinical protocols to guide structured reporting
Integrates specialized tools for accurate medical categorization
🔎 Similar Papers
No similar papers found.
S
Shrish Shrinath Vaidya
Department of Data Science and AI, IIT Madras, India
G
Gowthamaan Palani
Department of Engineering Design, IIT Madras, India
S
Sidharth Ramesh
Department of Data Science and AI, IIT Madras, India
V
Velmurugan Balasubramanian
LoveForm Health Technologies, India
M
Minmini Selvam
Department of Radiology and Imaging Sciences, Sri Ramachandra Institute of Higher Education and Research, India
G
Gokulraja Srinivasaraja
Department of Neuro and Interventional Radiology, Sri Ramachandra Institute of Higher Education and Research, India
Ganapathy Krishnamurthi
Ganapathy Krishnamurthi
Associate Professor at IIT-Madras
medical imaging