Finally Outshining the Random Baseline: A Simple and Effective Solution for Active Learning in 3D Biomedical Imaging

📅 2026-01-20
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the limited performance gains of existing active learning methods over enhanced random sampling baselines in 3D biomedical image segmentation, primarily attributed to class imbalance and insufficient query diversity. To overcome these challenges, the authors propose ClaSP PE, a novel query strategy that integrates class-stratified sampling, logarithmic-scale power-law noise injection, and a decay scheduling mechanism to promote diversity during early acquisition rounds while focusing on high-value samples in later stages. Implemented within the nnActive framework without requiring task-specific hyperparameter tuning, ClaSP PE consistently outperforms random baselines across 24 experimental settings on four datasets, achieving simultaneous improvements in segmentation accuracy and annotation efficiency. Furthermore, the method demonstrates robust generalization on four unseen datasets, confirming its stability and broad applicability.

Technology Category

Application Category

📝 Abstract
Active learning (AL) has the potential to drastically reduce annotation costs in 3D biomedical image segmentation, where expert labeling of volumetric data is both time-consuming and expensive. Yet, existing AL methods are unable to consistently outperform improved random sampling baselines adapted to 3D data, leaving the field without a reliable solution. We introduce Class-stratified Scheduled Power Predictive Entropy (ClaSP PE), a simple and effective query strategy that addresses two key limitations of standard uncertainty-based AL methods: class imbalance and redundancy in early selections. ClaSP PE combines class-stratified querying to ensure coverage of underrepresented structures and log-scale power noising with a decaying schedule to enforce query diversity in early-stage AL and encourage exploitation later. In our evaluation on 24 experimental settings using four 3D biomedical datasets within the comprehensive nnActive benchmark, ClaSP PE is the only method that generally outperforms improved random baselines in terms of both segmentation quality with statistically significant gains, whilst remaining annotation efficient. Furthermore, we explicitly simulate the real-world application by testing our method on four previously unseen datasets without manual adaptation, where all experiment parameters are set according to predefined guidelines. The results confirm that ClaSP PE robustly generalizes to novel tasks without requiring dataset-specific tuning. Within the nnActive framework, we present compelling evidence that an AL method can consistently outperform random baselines adapted to 3D segmentation, in terms of both performance and annotation efficiency in a realistic, close-to-production scenario. Our open-source implementation and clear deployment guidelines make it readily applicable in practice. Code is at https://github.com/MIC-DKFZ/nnActive.
Problem

Research questions and friction points this paper is trying to address.

active learning
3D biomedical imaging
annotation efficiency
random baseline
image segmentation
Innovation

Methods, ideas, or system contributions that make the work stand out.

active learning
3D biomedical imaging
class imbalance
query diversity
predictive entropy
🔎 Similar Papers
No similar papers found.
C
Carsten T. Luth
German Cancer Research Center (DKFZ) Heidelberg, Division of Medical Image Computing, Germany; Helmholtz Imaging, German Cancer Research Center (DKFZ), Heidelberg, Germany; Faculty of Mathematics and Computer Science, University of Heidelberg, Germany
J
Jeremias Traub
German Cancer Research Center (DKFZ) Heidelberg, Division of Medical Image Computing, Germany; Helmholtz Imaging, German Cancer Research Center (DKFZ), Heidelberg, Germany; German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Germany
K
Kim-Celine Kahl
German Cancer Research Center (DKFZ) Heidelberg, Division of Medical Image Computing, Germany; Helmholtz Imaging, German Cancer Research Center (DKFZ), Heidelberg, Germany; Faculty of Mathematics and Computer Science, University of Heidelberg, Germany
T
Till J. Bungert
German Cancer Research Center (DKFZ) Heidelberg, Division of Medical Image Computing, Germany; Helmholtz Imaging, German Cancer Research Center (DKFZ), Heidelberg, Germany; Faculty of Mathematics and Computer Science, University of Heidelberg, Germany
Lukas Klein
Lukas Klein
EPFL, USZ
Machine LearningBiotechComputer Vision
L
Lars Kramer
German Cancer Research Center (DKFZ) Heidelberg, Division of Medical Image Computing, Germany; Helmholtz Imaging, German Cancer Research Center (DKFZ), Heidelberg, Germany; Faculty of Mathematics and Computer Science, University of Heidelberg, Germany
P
Paul F. Jager
German Cancer Research Center (DKFZ) Heidelberg, Division of Medical Image Computing, Germany; German Cancer Research Center (DKFZ) Heidelberg, Interactive Machine Learning Group, Germany
K
K. Maier-Hein
German Cancer Research Center (DKFZ) Heidelberg, Division of Medical Image Computing, Germany; Helmholtz Imaging, German Cancer Research Center (DKFZ), Heidelberg, Germany; Faculty of Mathematics and Computer Science, University of Heidelberg, Germany; Pattern Analysis and Learning Group, Department of Radiation Oncology, Heidelberg University Hospital, Germany; National Center for Tumor Diseases (NCT) Heidelberg, Germany
Fabian Isensee
Fabian Isensee
HIP Applied Computer Vision Lab, Division of Medical Image Computing, German Cancer Research Center
Computer VisionDeep LearningSegmentationMedical Image Computing