Foundation Models -- A Panacea for Artificial Intelligence in Pathology?

📅 2025-02-28
📈 Citations: 0
✨ Influential: 0
📄 PDF
🤖 AI Summary
Whether foundation models (FMs) serve as a “universal solution” for AI in digital pathology—specifically for prostate cancer diagnosis and Gleason grading—remains unclear. Method: We constructed a standardized, multi-center evaluation framework using >100,000 needle biopsy images from 15 institutions across 11 countries, comparing self-supervised FMs against task-specific convolutional neural networks (TS-CNNs) trained end-to-end. Contribution/Results: Contrary to prevailing assumptions, FMs showed no statistically significant advantage over TS-CNNs under data-sufficient conditions. TS-CNNs matched or surpassed FMs in diagnostic accuracy, Gleason grade concordance, and critical misclassification rates, while demonstrating superior cross-device generalizability and consuming only ~2.9% of the FM’s inference energy. These findings empirically validate that lightweight, task-optimized, robust architectures—not generic FMs—are more clinically deployable for routine pathology AI applications.

Technology Category

Application Category

📝 Abstract
The role of artificial intelligence (AI) in pathology has evolved from aiding diagnostics to uncovering predictive morphological patterns in whole slide images (WSIs). Recently, foundation models (FMs) leveraging self-supervised pre-training have been widely advocated as a universal solution for diverse downstream tasks. However, open questions remain about their clinical applicability and generalization advantages over end-to-end learning using task-specific (TS) models. Here, we focused on AI with clinical-grade performance for prostate cancer diagnosis and Gleason grading. We present the largest validation of AI for this task, using over 100,000 core needle biopsies from 7,342 patients across 15 sites in 11 countries. We compared two FMs with a fully end-to-end TS model in a multiple instance learning framework. Our findings challenge assumptions that FMs universally outperform TS models. While FMs demonstrated utility in data-scarce scenarios, their performance converged with - and was in some cases surpassed by - TS models when sufficient labeled training data were available. Notably, extensive task-specific training markedly reduced clinically significant misgrading, misdiagnosis of challenging morphologies, and variability across different WSI scanners. Additionally, FMs used up to 35 times more energy than the TS model, raising concerns about their sustainability. Our results underscore that while FMs offer clear advantages for rapid prototyping and research, their role as a universal solution for clinically applicable medical AI remains uncertain. For high-stakes clinical applications, rigorous validation and consideration of task-specific training remain critically important. We advocate for integrating the strengths of FMs and end-to-end learning to achieve robust and resource-efficient AI pathology solutions fit for clinical use.
Problem

Research questions and friction points this paper is trying to address.

Evaluating clinical applicability of foundation models in pathology
Comparing performance of foundation models vs. task-specific models
Assessing energy efficiency and sustainability of foundation models
Innovation

Methods, ideas, or system contributions that make the work stand out.

Foundation models for diverse pathology tasks
Comparison with task-specific end-to-end models
Energy efficiency and clinical validation importance
🔎 Similar Papers
No similar papers found.
Nita Mulliqi
Nita Mulliqi
PhD Student at Karolinska Institute
Artificial intelligenceBiomedical Image ProcessingDigital PathologyProstate Cancer
A
Anders Blilie
Department of Pathology, Stavanger University Hospital, Stavanger, Norway; Faculty of Health Sciences, University of Stavanger, Stavanger, Norway
Xiaoyi Ji
Xiaoyi Ji
Karolinska Institutet
K
Kelvin Szolnoky
Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
Henrik Olsson
Henrik Olsson
Complexity Science Hub, Vienna
Complexity ScienceCognitive ScienceMachine Learning
Sol Erika Boman
Sol Erika Boman
PhD, Karolinska Institute
Digital pathologyCancer epidemiology
M
Matteo Titus
Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
G
Geraldine Martinez Gonzalez
Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
J
Julia Anna Mielcarz
Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
M
Masi Valkonen
Institute of Biomedicine, University of Turku, Turku, Finland
E
Einar Gudlaugsson
Department of Pathology, Stavanger University Hospital, Stavanger, Norway
S
Svein R. Kjosavik
The General Practice and Care Coordination Research Group, Stavanger University Hospital, Norway; Department of Global Public Health and Primary Care, Faculty of Medicine, University of Bergen, Norway
J
JosĂŠ Asenjo
Department of Pathology, Synlab, Madrid, Spain
M
Marcello Gambacorta
Department of Pathology, Synlab, Brescia, Italy
P
Paolo Libretti
Department of Pathology, Synlab, Brescia, Italy
M
Marcin Braun
Department of Pathology, Chair of Oncology, Medical University of Lodz, Lodz, Poland
R
Radzislaw Kordek
Department of Pathology, Chair of Oncology, Medical University of Lodz, Lodz, Poland
R
Roman Łowicki
1st Department of Urology, Medical University of Lodz, Lodz, Poland
K
Kristina Hotakainen
Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland; Laboratory Services, Mehiläinen Oy, Helsinki, Finland
P
Päivi Väre
Department of Pathology, Mehiläinen Länsi-Pohja Hospital, Kemi, Finland
B
Bodil Ginnerup Pedersen
Department of Radiology, Aarhus University Hospital, Aarhus, Denmark
K
Karina Dalsgaard Sørensen
Department of Clinical Medicine, Aarhus University, Aarhus, Denmark; Department of Molecular Medicine, Aarhus University Hospital, Aarhus, Denmark
B
Benedicte Parm Ulhøi
Department of Pathology, Aarhus University Hospital, Aarhus, Denmark
Pekka Ruusuvuori
Pekka Ruusuvuori
University of Turku
bioimage informaticsmachine learning & AIimage analysiscomputational pathology
B
Brett Delahunt
Malaghan Institute of Medical Research, Wellington, New Zealand; Department of Oncology and Pathology, Karolinska Institutet, Stockholm, Sweden
H
Hemamali Samaratunga
Aquesta Uropathology and University of Queensland, QLD, Brisbane, Australia
T
Toyonori Tsuzuki
Department of Surgical Pathology, School of Medicine, Aichi Medical University, Nagoya, Japan
E
Emilius A.M. Janssen
Department of Pathology, Stavanger University Hospital, Stavanger, Norway; Department of Chemistry, Bioscience and Environmental Engineering, University of Stavanger, Stavanger, Norway; Institute for Biomedicine and Glycomics, Griffith University, Queensland, Australia
L
Lars Egevad
Department of Oncology and Pathology, Karolinska Institutet, Stockholm, Sweden
Martin Eklund
Martin Eklund
Professor of Epidemiology, Karolinska Institutet
Kimmo Kartasalo
Kimmo Kartasalo
Assistant Professor, Karolinska institutet
Artificial intelligenceComputational pathologyImage analysisMachine learning