AIPatient: Simulating Patients with EHRs and LLM Powered Agentic Workflow

📅 2024-09-27
🏛️ arXiv.org
📈 Citations: 8
Influential: 1
📄 PDF
🤖 AI Summary
This study addresses the lack of high-fidelity, trustworthy AI-based simulated patients for medical education and clinical decision-making. We propose AIPatient: a framework that constructs a clinical knowledge graph from MIMIC-III electronic health records (EHRs) and implements a reasoning-augmented retrieval-augmented generation (RAG) multi-agent workflow—comprising six specialized LLM agents for retrieval, knowledge graph querying, abstraction, verification, rewriting, and summarization—to enable knowledge-driven dynamic reasoning and natural clinician–patient interaction. Its key innovation lies in introducing the first domain-specific clinical knowledge graph and a structured multi-agent reasoning paradigm for healthcare. Experiments demonstrate strong performance: 94.15% accuracy on EHR question answering, a knowledge base F1-score of 0.89, high textual readability (Flesch Reading Ease median = 77.23), and statistical robustness (p > 0.1), all meeting practical deployment requirements.

Technology Category

Application Category

📝 Abstract
Simulated patient systems play a crucial role in modern medical education and research, providing safe, integrative learning environments and enabling clinical decision-making simulations. Large Language Models (LLM) could advance simulated patient systems by replicating medical conditions and patient-doctor interactions with high fidelity and low cost. However, ensuring the effectiveness and trustworthiness of these systems remains a challenge, as they require a large, diverse, and precise patient knowledgebase, along with a robust and stable knowledge diffusion to users. Here, we developed AIPatient, an advanced simulated patient system with AIPatient Knowledge Graph (AIPatient KG) as the input and the Reasoning Retrieval-Augmented Generation (Reasoning RAG) agentic workflow as the generation backbone. AIPatient KG samples data from Electronic Health Records (EHRs) in the Medical Information Mart for Intensive Care (MIMIC)-III database, producing a clinically diverse and relevant cohort of 1,495 patients with high knowledgebase validity (F1 0.89). Reasoning RAG leverages six LLM powered agents spanning tasks including retrieval, KG query generation, abstraction, checker, rewrite, and summarization. This agentic framework reaches an overall accuracy of 94.15% in EHR-based medical Question Answering (QA), outperforming benchmarks that use either no agent or only partial agent integration. Our system also presents high readability (median Flesch Reading Ease 77.23; median Flesch Kincaid Grade 5.6), robustness (ANOVA F-value 0.6126, p>0.1), and stability (ANOVA F-value 0.782, p>0.1). The promising performance of the AIPatient system highlights its potential to support a wide range of applications, including medical education, model evaluation, and system integration.
Problem

Research questions and friction points this paper is trying to address.

Enhancing medical training with AI-powered simulated patients
Improving accuracy in EHR-based medical question answering
Ensuring high fidelity and usability for medical education
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses large language model-based AI agents
Incorporates Retrieval Augmented Generation framework
Leverages Knowledge Graph from real patient data
🔎 Similar Papers
No similar papers found.
H
Huizi Yu
University of Michigan, Ann Arbor, MI, United States
Jiayan Zhou
Jiayan Zhou
Stanford University
GenomicsEnvironmental Health ScienceKnowledge-based ModelingStatistical Modeling
Lingyao Li
Lingyao Li
Assistant Professor, School of Information, University of South Florida
Generative AISocial ComputingUrban ComputingHealth Informatics
S
Shan Chen
Artificial Intelligence in Medicine Program, Mass General Brigham, Boston, MA, United States; Harvard Medical School, Boston, MA, United States
Jack Gallifant
Jack Gallifant
AIM @ Harvard-MGB
AIAlignmentHealthcareInterpretabilityRobustness
A
Anye Shi
Cornell University, Ithaca, NY, United States
X
Xiang Li
Mass General Brigham, Boston, MA, United States
Wenyue Hua
Wenyue Hua
Senior Researcher, Microsoft Research
LLM-based agentlarge language modelcomputational linguisticsrecommender system
Mingyu Jin
Mingyu Jin
Ph.D Student on Computer Science, Rutgers University, New Brunswick
Natural Language ProcessingInterpretable Machine Learning
G
Guang Chen
Harvard Medical School, Boston, MA, United States
Y
Yang Zhou
Cornell University, Ithaca, NY, United States
Z
Zhao Li
Mass General Brigham, Boston, MA, United States
T
Trisha P Gupte
Artificial Intelligence in Medicine Program, Mass General Brigham, Boston, MA, United States
Ming-Li Chen
Ming-Li Chen
Artificial Intelligence in Medicine Program, Mass General Brigham, Boston, MA, United States
Z
Zahra Azizi
Artificial Intelligence in Medicine Program, Mass General Brigham, Boston, MA, United States; Harvard Medical School, Boston, MA, United States
Y
Yongfeng Zhang
Mass General Brigham, Boston, MA, United States
T
Themistocles L. Assimes
Artificial Intelligence in Medicine Program, Mass General Brigham, Boston, MA, United States
X
Xin Ma
Mass General Brigham, Boston, MA, United States
Danielle S. Bitterman
Danielle S. Bitterman
Harvard Medical School
OncologyNatural Language ProcessingArtificial Intelligence
Lin Lu
Lin Lu
PhD student, Nankai University
Conformal inferenceMultiple testing
Lizhou Fan
Lizhou Fan
Vice-Chancellor Assistant Professor, The Chinese University of Hong Kong
Medical AIHealth InformaticsAI AgentsAI for SciencePsychiatry