Agent Hospital: A Simulacrum of Hospital with Evolvable Medical Agents

📅 2024-05-05
🏛️ arXiv.org
📈 Citations: 43
Influential: 5
📄 PDF
🤖 AI Summary
Large language model (LLM)-driven virtual physicians often lack robust diagnostic capabilities in real-world clinical settings due to insufficient domain-specific reasoning and limited exposure to diverse, high-fidelity medical scenarios. Method: We propose MedAgent-Zero—a knowledge-driven, closed-loop evolutionary multi-agent virtual hospital simulation framework. It integrates disease dynamics modeling, a structured medical knowledge base, and collaborative multi-agent orchestration to simulate end-to-end clinical workflows involving patient, nurse, and physician agents. The physician agent autonomously refines its clinical decision-making policy through reinforcement learning on ~10,000 synthetic cases with success/failure feedback, then transfers the learned capabilities to real-world evaluation. Contribution/Results: On the respiratory-disease subset of MedQA, MedAgent-Zero achieves 93.06% accuracy—setting a new state-of-the-art—and demonstrates significantly improved generalization over existing LLM-based diagnostic approaches.

Technology Category

Application Category

📝 Abstract
In this paper, we introduce a simulacrum of hospital called Agent Hospital that simulates the entire process of treating illness. All patients, nurses, and doctors are autonomous agents powered by large language models (LLMs). Our central goal is to enable a doctor agent to learn how to treat illness within the simulacrum. To do so, we propose a method called MedAgent-Zero. As the simulacrum can simulate disease onset and progression based on knowledge bases and LLMs, doctor agents can keep accumulating experience from both successful and unsuccessful cases. Simulation experiments show that the treatment performance of doctor agents consistently improves on various tasks. More interestingly, the knowledge the doctor agents have acquired in Agent Hospital is applicable to real-world medicare benchmarks. After treating around ten thousand patients (real-world doctors may take over two years), the evolved doctor agent achieves a state-of-the-art accuracy of 93.06% on a subset of the MedQA dataset that covers major respiratory diseases. This work paves the way for advancing the applications of LLM-powered agent techniques in medical scenarios.
Problem

Research questions and friction points this paper is trying to address.

Virtual Doctors
Diagnostic Ability
Simulation Environment
Innovation

Methods, ideas, or system contributions that make the work stand out.

Agent Hospital
MedAgent-Zero
AI-controlled healthcare agents
🔎 Similar Papers
No similar papers found.
J
Junkai Li
Institute for AI Research (AIR), Tsinghua University, China; Department of Computer Science and Technology, Tsinghua University, China
S
Siyu Wang
Institute for AI Research (AIR), Tsinghua University, China
M
Meng Zhang
Institute for AI Research (AIR), Tsinghua University, China
Weitao Li
Weitao Li
清华大学计算机系
RAG RLAgentsMedicine
Yunghwei Lai
Yunghwei Lai
Institute for AI Industry Research, Tsinghua University
LLM Agent | AI Healthcare
X
Xinhui Kang
Institute for AI Research (AIR), Tsinghua University, China; Department of Computer Science and Technology, Tsinghua University, China
Weizhi Ma
Weizhi Ma
Tsinghua University
LLM and AgentsRecommendationAI for Healthcare
Y
Yang Liu
Institute for AI Research (AIR), Tsinghua University, China; Department of Computer Science and Technology, Tsinghua University, China