Towards A Generalizable Pathology Foundation Model via Unified Knowledge Distillation

📅 2024-07-26
🏛️ arXiv.org
📈 Citations: 11
Influential: 2
📄 PDF
🤖 AI Summary
Current foundational models in computational pathology lack systematic evaluation of generalization capability across diverse clinical tasks and tissue types. Method: We introduce the first comprehensive pathology generalization benchmark—covering 39 clinical tasks across six categories and 34 tissue types—to systematically identify generalization bottlenecks in state-of-the-art models. Leveraging this benchmark, we propose a unified knowledge distillation framework that jointly employs multi-expert guidance and self-distillation to align local and global representations, enabling the pretraining of a general-purpose foundational model, GPFM. GPFM is trained via large-scale self-supervised representation learning on over one million H&E-stained whole-slide images. Contribution/Results: GPFM achieves top-1 performance on 29 of the 39 tasks, with a mean rank of 1.36—significantly outperforming UNI (2.96). It establishes a new feature backbone for computational pathology, setting a benchmark for cross-task and cross-tissue generalization.

Technology Category

Application Category

📝 Abstract
Foundation models pretrained on large-scale datasets are revolutionizing the field of computational pathology (CPath). The generalization ability of foundation models is crucial for the success in various downstream clinical tasks. However, current foundation models have only been evaluated on a limited type and number of tasks, leaving their generalization ability and overall performance unclear. To address this gap, we established a most comprehensive benchmark to evaluate the performance of off-the-shelf foundation models across six distinct clinical task types, encompassing a total of 39 specific tasks. Our findings reveal that existing foundation models excel at certain task types but struggle to effectively handle the full breadth of clinical tasks. To improve the generalization of pathology foundation models, we propose a unified knowledge distillation framework consisting of both expert and self knowledge distillation, where the former allows the model to learn from the knowledge of multiple expert models, while the latter leverages self-distillation to enable image representation learning via local-global alignment. Based on this framework, a Generalizable Pathology Foundation Model (GPFM) is pretrained on a large-scale dataset consisting of 190 million images from around 86,000 public H&E whole slides across 34 major tissue types. Evaluated on the established benchmark, GPFM achieves an impressive average rank of 1.36, with 29 tasks ranked 1st, while the the second-best model, UNI, attains an average rank of 2.96, with only 4 tasks ranked 1st. The superior generalization of GPFM demonstrates its exceptional modeling capabilities across a wide range of clinical tasks, positioning it as a new cornerstone for feature representation in CPath.
Problem

Research questions and friction points this paper is trying to address.

Evaluating generalization of pathology foundation models across diverse clinical tasks
Proposing unified knowledge distillation to enhance model adaptability
Developing a comprehensive benchmark with 72 specific pathology tasks
Innovation

Methods, ideas, or system contributions that make the work stand out.

Unified knowledge distillation framework
Expert and self-knowledge distillation
Generalizable Pathology Foundation Model
🔎 Similar Papers
No similar papers found.
J
Jiabo Ma
Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Hong Kong SAR, China.
Z
Zhengrui Guo
Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Hong Kong SAR, China.
Fengtao Zhou
Fengtao Zhou
Hong Kong University of Science and Technology
Multimodal LearningComputational Pathology
Y
Yihui Wang
Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Hong Kong SAR, China.
Yingxue Xu
Yingxue Xu
The Hong Kong University of Science and Technology
Multimodal LearningSurvival AnalysisComputational Pathology
Y
Yu Cai
Department of Electronic and Computer Engineering, The Hong Kong University of Science and Technology, Hong Kong SAR, China.
Z
Zhengjie Zhu
Information Hub, The Hong Kong University of Science and Technology (Guangzhou), Guangzhou, China.
C
Cheng Jin
Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Hong Kong SAR, China.
Y
Yi Lin
Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Hong Kong SAR, China.
Xinrui Jiang
Xinrui Jiang
The Hong Kong University of Science and Technology (HKUST)
Computer visionMedical image analysis
A
Anjia Han
Department of Pathology, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, China.
Li Liang
Li Liang
The University of Western Australia
3D Point Cloud Processing3D Semantic Scene Completion3D Semantic Scene Generation
R
R. Chan
Department of Anatomical and Cellular Pathology, The Chinese University of Hong Kong, Hong Kong SAR, China.
J
Jiguang Wang
Department of Chemical and Biological Engineering, The Hong Kong University of Science and Technology, Hong Kong SAR, China.
K
Kwang-Ting Cheng
Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Hong Kong SAR, China.
H
Hao Chen
Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Hong Kong SAR, China.; Department of Chemical and Biological Engineering, The Hong Kong University of Science and Technology, Hong Kong SAR, China.; Division of Life Science, The Hong Kong University of Science and Technology, Hong Kong SAR, China.