Merging Clinical Knowledge into Large Language Models for Medical Research and Applications: A Survey

📅 2025-02-28
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Current medical AI development lacks systematic construction guidelines, particularly suffering from paradigmatic gaps and inconsistent evaluation criteria for integrating clinical knowledge into large language models (LLMs). Method: This study presents the first comprehensive review of academic and industrial practices in building medical AI systems, proposing an integrated framework encompassing clinical databases (e.g., MIMIC, UMLS), biomedical knowledge graphs, instruction tuning, retrieval-augmented generation (RAG), and multi-granularity evaluation. We establish a six-dimensional comparative taxonomy and distill insights from representative systems—including DoctorGPT and Pangu-Drug. Contribution/Results: The work identifies key knowledge-enhancement pathways and clinical deployment bottlenecks, highlighting interpretability, data privacy, and clinical alignment as critical challenges. It provides both theoretical foundations and practical guidance for the standardized development and trustworthy deployment of medical LLMs.

Technology Category

Application Category

📝 Abstract
Clinical knowledge is the collection of information learned from studies on the causes, prognosis, diagnosis, and treatment of diseases. This type of knowledge can improve curing performances, and promote physical health. With the emergence of large language models (LLMs), medical artificial intelligence (medical AI), which aims to apply academic medical AI systems to real-world medical scenarios, has entered a new age of development, resulting in excellent works such as DoctorGPT and Pangu-Drug from academic and industrial researches. However, the field lacks a comprehensive compendium and comparison of building medical AI systems from academia and industry. Therefore, this survey focuses on the building paradigms of medical AI systems including the use of clinical databases, datasets, training pipelines, integrating medical knowledge graphs, system applications, and evaluation systems. We hope that this survey can help relevant practical researchers understand the current performance of academic models in various fields of healthcare, as well as the potential problems and future directions for implementing these scientific achievements.
Problem

Research questions and friction points this paper is trying to address.

Lack of comprehensive review on medical AI systems development.
Need for integrating clinical knowledge into large language models.
Challenges in applying academic medical AI to real-world scenarios.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Integrates clinical databases into AI training pipelines
Utilizes medical knowledge graphs for enhanced accuracy
Compares academic and industrial medical AI systems
🔎 Similar Papers
No similar papers found.
Q
Qiyuan Li
Haijiang Liu
Haijiang Liu
Wuhan University of Science and Technology
Cross-cultural NLPKnowledge GraphLarge language modelMultilingual NLP
C
Caicai Guo
D
Deyu Chen
M
Meng Wang
F
Feng Gao
J
Jinguang Gu