Boosting Pathology Foundation Models via Few-shot Prompt-tuning for Rare Cancer Subtyping

📅 2025-08-21
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Rare cancer subtyping faces critical challenges including scarcity of expert pathologists, limited labeled data, and poor model interpretability—particularly in pediatric oncology, where rare subtypes constitute over 70% of cases. While existing vision-language foundation models exhibit strong zero-shot performance on common cancers, their clinical utility for rare subtypes remains limited; mainstream multiple-instance learning approaches rely solely on visual features, lacking cross-modal semantic alignment and fine-grained tumor localization capability. To address these gaps, we propose PathPT—a novel framework that pioneers the use of vision-language models to generate slice-level weak supervision signals via zero-shot inference. PathPT integrates spatially aware feature aggregation with task-adaptive few-shot prompt tuning to achieve precise tumor region localization and cross-modal pathological semantic alignment. Evaluated on eight rare cancer datasets, PathPT achieves significant improvements in average subtyping accuracy, while simultaneously enhancing localization precision and interpretability—demonstrating robust generalizability across both adult and pediatric rare tumors.

Technology Category

Application Category

📝 Abstract
Rare cancers comprise 20-25% of all malignancies but face major diagnostic challenges due to limited expert availability-especially in pediatric oncology, where they represent over 70% of cases. While pathology vision-language (VL) foundation models show promising zero-shot capabilities for common cancer subtyping, their clinical performance for rare cancers remains limited. Existing multi-instance learning (MIL) methods rely only on visual features, overlooking cross-modal knowledge and compromising interpretability critical for rare cancer diagnosis. To address this limitation, we propose PathPT, a novel framework that fully exploits the potential of vision-language pathology foundation models through spatially-aware visual aggregation and task-specific prompt tuning. Unlike conventional MIL, PathPT converts WSI-level supervision into fine-grained tile-level guidance by leveraging the zero-shot capabilities of VL models, thereby preserving localization on cancerous regions and enabling cross-modal reasoning through prompts aligned with histopathological semantics. We benchmark PathPT on eight rare cancer datasets(four adult and four pediatric) spanning 56 subtypes and 2,910 WSIs, as well as three common cancer datasets, evaluating four state-of-the-art VL models and four MIL frameworks under three few-shot settings. Results show that PathPT consistently delivers superior performance, achieving substantial gains in subtyping accuracy and cancerous region grounding ability. This work advances AI-assisted diagnosis for rare cancers, offering a scalable solution for improving subtyping accuracy in settings with limited access to specialized expertise.
Problem

Research questions and friction points this paper is trying to address.

Improving rare cancer subtyping accuracy with limited expert annotations
Enhancing vision-language models' performance for rare cancer diagnostics
Addressing interpretability limitations in multi-instance learning methods
Innovation

Methods, ideas, or system contributions that make the work stand out.

Few-shot prompt-tuning for pathology models
Spatially-aware visual aggregation technique
Cross-modal reasoning with histopathological prompts
🔎 Similar Papers
No similar papers found.
D
Dexuan He
School of Artificial Intelligence, Shanghai Jiao Tong University
Xiao Zhou
Xiao Zhou
M.Phil student in HKUST
Autonomous DrivingDRL
W
Wenbin Guan
Department of Pathology, Xinhua Hospital, Affiliated to Shanghai Jiao Tong University School of Medicine
L
Liyuan Zhang
School of Artificial Intelligence, Shanghai Jiao Tong University
Xiaoman Zhang
Xiaoman Zhang
Harvard University
AI for MedicineMedical Image Analysis
S
Sinuo Xu
School of Artificial Intelligence, Shanghai Jiao Tong University
G
Ge Wang
Department of Oral Pathology, Shanghai Ninth People’s Hospital, Shanghai Jiao Tong University School of Medicine
Lifeng Wang
Lifeng Wang
Institute of Advanced Science Facilities, Shenzhen
High-order harmonic generationattosecond physics
Xiaojun Yuan
Xiaojun Yuan
University of Electronic Science and Technology of China
statistical signal processingmachine learningwireless communications
X
Xin Sun
Clinical Research and Innovation Unit, Xinhua Hospital, Affiliated to Shanghai Jiao Tong University School of Medicine
Yanfeng Wang
Yanfeng Wang
Shanghai Jiao Tong University
K
Kun Sun
Department of Pediatric Cardiology, Xinhua Hospital, Affiliated to Shanghai Jiao Tong University School of Medicine
Ya Zhang
Ya Zhang
Shanghai Jiao Tong University
Machine learningComputer visionMedical Imaging
Weidi Xie
Weidi Xie
Shanghai Jiao Tong University | VGG, University of Oxford
Computer VisionAI for HealthcareAI for Science