Towards Foundation Models and Few-Shot Parameter-Efficient Fine-Tuning for Volumetric Organ Segmentation

📅 2023-03-29
🏛️ ISIC/Care-AI/MedAGI/DeCaF@MICCAI
📈 Citations: 12
Influential: 1
📄 PDF
🤖 AI Summary
To address the challenges of scarce annotated data, high computational cost, and poor generalization in 3D medical image segmentation, this paper pioneers the adaptation of foundation model paradigms to volumetric segmentation. We propose a few-shot, parameter-efficient unified fine-tuning framework integrating a Vision Transformer backbone, LoRA adapters, learnable prompt tuning, and cross-modal feature alignment. The method achieves rapid adaptation to novel organs using only 1–5 annotated samples per target anatomy. Evaluated on multi-center CT datasets including BTCV, it attains state-of-the-art performance while reducing trainable parameters by 98% and accelerating inference by 40%. Our core contribution is the first foundation-model-based few-shot adaptation framework specifically designed for volumetric segmentation—enabling synergistic optimization of minimal annotation requirements, ultra-low parameter updates, and strong generalization across anatomies and domains.
Problem

Research questions and friction points this paper is trying to address.

Addressing resource-intensive full fine-tuning for medical image segmentation
Enabling few-shot adaptation with parameter-efficient methods
Improving dense prediction tasks via spatial black-box adapters
Innovation

Methods, ideas, or system contributions that make the work stand out.

Parameter-Efficient Fine-Tuning for medical images
Black-box Adapters for dense prediction tasks
Few-Shot Efficient Fine-Tuning with data efficiency