🤖 AI Summary
This paper addresses the lack of a unified analytical framework and insufficient integration of ethical and technical considerations in current medical large language models (LLMs). To bridge this gap, we propose the first end-to-end analytical framework covering data curation, instruction tuning, alignment optimization, and safety enhancement—revealing a paradigm shift from discriminative to generative AI and from model-centric to data-centric healthcare AI. Grounded in clinical workflow requirements, we systematically survey domain-specific datasets, open benchmarks, and evaluation methodologies. Furthermore, we articulate a co-designed technical-governance pathway that ensures fairness, accountability, and transparency. The study culminates in an interdisciplinary survey report and a publicly available GitHub repository, offering both theoretical foundations and actionable guidance for the responsible development and deployment of medical LLMs.
📝 Abstract
The utilization of large language models (LLMs) in the Healthcare domain has generated both excitement and concern due to their ability to effectively respond to freetext queries with certain professional knowledge. This survey outlines the capabilities of the currently developed LLMs for Healthcare and explicates their development process, with the aim of providing an overview of the development roadmap from traditional Pretrained Language Models (PLMs) to LLMs. Specifically, we first explore the potential of LLMs to enhance the efficiency and effectiveness of various Healthcare applications highlighting both the strengths and limitations. Secondly, we conduct a comparison between the previous PLMs and the latest LLMs, as well as comparing various LLMs with each other. Then we summarize related Healthcare training data, training methods, optimization strategies, and usage. Finally, the unique concerns associated with deploying LLMs in Healthcare settings are investigated, particularly regarding fairness, accountability, transparency and ethics. Our survey provide a comprehensive investigation from perspectives of both computer science and Healthcare specialty. Besides the discussion about Healthcare concerns, we supports the computer science community by compiling a collection of open source resources, such as accessible datasets, the latest methodologies, code implementations, and evaluation benchmarks in the Github. Summarily, we contend that a significant paradigm shift is underway, transitioning from PLMs to LLMs. This shift encompasses a move from discriminative AI approaches to generative AI approaches, as well as a shift from model-centered methodologies to data-centered methodologies. Also, we determine that the biggest obstacle of using LLMs in Healthcare are fairness, accountability, transparency and ethics.