Pancakes: Consistent Multi-Protocol Image Segmentation Across Biomedical Domains

📅 2025-12-15
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing medical image segmentation models are constrained by single-segmentation protocols or reliance on manual prompts, limiting their ability to automatically support concurrent segmentation across diverse semantic protocols—such as tissue, anatomy, and pathology—in unseen domains. Method: We propose a novel multi-protocol consistent segmentation paradigm, built upon a vision–semantics-aligned multi-head decoder architecture. It incorporates protocol-aware feature disentanglement and cross-image semantic consistency regularization, trained via a zero-shot domain generalization strategy to enable fully automatic, multi-label, semantically consistent segmentation without human intervention. Contribution/Results: Evaluated on seven hold-out datasets, our method achieves state-of-the-art performance in multi-protocol segmentation plausibility, cross-image semantic consistency, and zero-shot generalization—significantly alleviating foundational model dependencies on single-protocol constraints or strong manual prompting.

Technology Category

Application Category

📝 Abstract
A single biomedical image can be meaningfully segmented in multiple ways, depending on the desired application. For instance, a brain MRI can be segmented according to tissue types, vascular territories, broad anatomical regions, fine-grained anatomy, or pathology, etc. Existing automatic segmentation models typically either (1) support only a single protocol, the one they were trained on, or (2) require labor-intensive manual prompting to specify the desired segmentation. We introduce Pancakes, a framework that, given a new image from a previously unseen domain, automatically generates multi-label segmentation maps for multiple plausible protocols, while maintaining semantic consistency across related images. Pancakes introduces a new problem formulation that is not currently attainable by existing foundation models. In a series of experiments on seven held-out datasets, we demonstrate that our model can significantly outperform existing foundation models in producing several plausible whole-image segmentations, that are semantically coherent across images.
Problem

Research questions and friction points this paper is trying to address.

Automatically generates multi-label segmentations for multiple protocols
Maintains semantic consistency across related biomedical images
Outperforms existing models in producing plausible whole-image segmentations
Innovation

Methods, ideas, or system contributions that make the work stand out.

Automatically generates multi-protocol segmentation maps
Ensures semantic consistency across related images
Introduces new problem formulation beyond foundation models
🔎 Similar Papers
No similar papers found.
Marianne Rakic
Marianne Rakic
PhD Student, Massachusetts Institute of Technology
Computer vision and Healthcare
S
Siyu Gai
MIT CSAIL, MGH
E
Etienne Chollet
MIT CSAIL, MGH
J
John V. Guttag
MIT CSAIL
A
Adrian V. Dalca
MIT CSAIL, HMS, MGH