Shazam: Unifying Multiple Foundation Models for Advanced Computational Pathology

📅 2025-03-02
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
In computational pathology, existing foundation models (FMs) suffer from poor generalizability and unstable performance when deployed across diverse downstream tasks. To address this, we propose Shazam—a lightweight, multi-model collaborative inference framework for multi-task adaptation. Its core innovation is the first-of-its-kind multi-teacher collaborative distillation mechanism, which employs dynamic weight scheduling and multi-source feature fusion to prevent dominance by any single model, thereby enabling task-driven feature refinement and complementary knowledge integration. Crucially, Shazam operates without fine-tuning individual FMs, significantly reducing computational overhead. Evaluated on two benchmark histopathological image patch classification datasets, Shazam consistently outperforms both individual models and state-of-the-art ensemble methods, achieving superior accuracy while reducing inference cost by over 30%. This establishes a robust, scalable, and unified inference paradigm for clinical pathology AI.

Technology Category

Application Category

📝 Abstract
Foundation Models (FMs) in computational pathology (CPath) have significantly advanced the extraction of meaningful features from histopathology image datasets, achieving strong performance across various clinical tasks. Despite their impressive performance, these models often exhibit variability when applied to different tasks, prompting the need for a unified framework capable of consistently excelling across various applications. In this work, we propose Shazam, a novel framework designed to efficiently combine multiple CPath models. Unlike previous approaches that train a fixed-parameter FM, Shazam dynamically extracts and refines information from diverse FMs for each specific task. To ensure that each FM contributes effectively without dominance, a novel distillation strategy is applied, guiding the student model with features from all teacher models, which enhances its generalization ability. Experimental results on two pathology patch classification datasets demonstrate that Shazam outperforms existing CPath models and other fusion methods. Its lightweight, flexible design makes it a promising solution for improving CPath analysis in real-world settings. Code will be available at https://github.com/Tuner12/Shazam.
Problem

Research questions and friction points this paper is trying to address.

Unifies multiple foundation models for computational pathology tasks.
Addresses variability in model performance across different clinical applications.
Enhances generalization through a novel distillation strategy.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Dynamic combination of multiple foundation models
Novel distillation strategy for model integration
Lightweight and flexible design for real-world applications
Wenhui Lei
Wenhui Lei
University of Pennsylvania
AI4HealthArtifical Intelligence
A
Anqi Li
Washington University in St. Louis
Y
Yusheng Tan
Washington University in St. Louis
H
Hanyu Chen
The First Hospital of China Medical University
X
Xiaofan Zhang
Shanghai Jiaotong University, Shanghai Artificial Intelligence Laboratory, Washington University in St. Louis