Evidence-based diagnostic reasoning with multi-agent copilot for human pathology

📅 2025-06-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Current computational pathology faces three critical bottlenecks: (1) deep learning models lack instruction-following capability and contextual multimodal fusion; (2) multimodal large language models (MLLMs) suffer from scarcity of annotated pathology data, inadequate support for multi-image reasoning, and absence of autonomous diagnostic reasoning; and (3) existing frameworks cannot handle gigapixel whole-slide images (WSIs) with hierarchical, interpretable inference. To address these, we propose PathChat+, introducing the first million-scale pathology instruction-tuning dataset and SlideSeek—a novel multi-agent system enabling hierarchical, autonomous WSI diagnosis. Our method integrates vision-language alignment, instruction tuning, and hierarchical reasoning to support natural-language interaction and generation of interpretable, multimodal diagnostic reports. Evaluated on benchmarks including DDxBench, PathChat+ significantly outperforms state-of-the-art models, achieving differential diagnosis accuracy at clinically viable levels.

Technology Category

Application Category

📝 Abstract
Pathology is experiencing rapid digital transformation driven by whole-slide imaging and artificial intelligence (AI). While deep learning-based computational pathology has achieved notable success, traditional models primarily focus on image analysis without integrating natural language instruction or rich, text-based context. Current multimodal large language models (MLLMs) in computational pathology face limitations, including insufficient training data, inadequate support and evaluation for multi-image understanding, and a lack of autonomous, diagnostic reasoning capabilities. To address these limitations, we introduce PathChat+, a new MLLM specifically designed for human pathology, trained on over 1 million diverse, pathology-specific instruction samples and nearly 5.5 million question answer turns. Extensive evaluations across diverse pathology benchmarks demonstrated that PathChat+ substantially outperforms the prior PathChat copilot, as well as both state-of-the-art (SOTA) general-purpose and other pathology-specific models. Furthermore, we present SlideSeek, a reasoning-enabled multi-agent AI system leveraging PathChat+ to autonomously evaluate gigapixel whole-slide images (WSIs) through iterative, hierarchical diagnostic reasoning, reaching high accuracy on DDxBench, a challenging open-ended differential diagnosis benchmark, while also capable of generating visually grounded, humanly-interpretable summary reports.
Problem

Research questions and friction points this paper is trying to address.

Lack of natural language integration in pathology AI models
Insufficient multi-image understanding in current MLLMs
Absence of autonomous diagnostic reasoning in computational pathology
Innovation

Methods, ideas, or system contributions that make the work stand out.

PathChat+ MLLM trained on pathology-specific data
SlideSeek system for autonomous WSI evaluation
Iterative hierarchical reasoning for accurate diagnosis
🔎 Similar Papers
No similar papers found.
Chengkuan Chen
Chengkuan Chen
Unknown affiliation
Machine learning in biomedical application
L
Luca L. Weishaupt
Department of Pathology, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA; Health Sciences and Technology, Harvard-MIT, Cambridge, MA
D
Drew F. K. Williamson
Department of Pathology, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA; Health Sciences and Technology, Harvard-MIT, Cambridge, MA; Department of Pathology, Massachusetts General Hospital, Harvard Medical School, Boston, MA
Richard J. Chen
Richard J. Chen
Brigham and Women's Hospital, Harvard Medical School
Representation LearningComputer VisionComputational Pathology
Tong Ding
Tong Ding
PhD student in Computer Science, Harvard University
Representation LearningComputer VisionMultimodal LearningMachine Learning for Health
B
Bowen Chen
Department of Pathology, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA; Health Sciences and Technology, Harvard-MIT, Cambridge, MA
Anurag Vaidya
Anurag Vaidya
MIT
Multimodal frontier modelsagentsclinical deployment
Long Phi Le
Long Phi Le
Massachusetts General Hospital
Molecular DiagnosticsNext-Generation SequencingTarget EnrichmentBioinformaticsLaboratory Informatics
Guillaume Jaume
Guillaume Jaume
Harvard Medical School, Brigham and Women's Hospital
Computational Pathology - Computer Vision
Ming Y. Lu
Ming Y. Lu
MIT EECS, Harvard Medical School
Computational PathologyComputer VisionNatural Language Processing
Faisal Mahmood
Faisal Mahmood
Associate Professor, Harvard University