🤖 AI Summary
The absence of large-scale, publicly available multi-annotator dermoscopic lesion segmentation (SLS) datasets hinders progress in medical image segmentation with multiple expert annotations. To address this, we introduce the largest open multi-annotator SLS dataset to date, comprising 14,967 dermoscopic images and 17,684 high-quality segmentation masks; notably, 2,394 images are independently annotated by 2–5 dermatology experts. For the first time, we systematically collect fine-grained metadata—including annotator expertise levels and tool preferences—alongside structured metadata acquisition, inter-annotator consistency evaluation, and consensus mask generation. The dataset is derived from ISIC Archive images and provides standardized train/val/test splits, consensus labels, and uncertainty quantification. It enables novel research directions including annotator preference modeling, robust segmentation under annotation variability, multi-annotator learning, and human-AI collaborative diagnosis.
📝 Abstract
Multi-annotator medical image segmentation is an important research problem, but requires annotated datasets that are expensive to collect. Dermoscopic skin lesion imaging allows human experts and AI systems to observe morphological structures otherwise not discernable from regular clinical photographs. However, currently there are no large-scale publicly available multi-annotator skin lesion segmentation (SLS) datasets with annotator-labels for dermoscopic skin lesion imaging. We introduce ISIC MultiAnnot++, a large public multi-annotator skin lesion segmentation dataset for images from the ISIC Archive. The final dataset contains 17,684 segmentation masks spanning 14,967 dermoscopic images, where 2,394 dermoscopic images have 2-5 segmentations per image, making it the largest publicly available SLS dataset. Further, metadata about the segmentation, including the annotators' skill level and segmentation tool, is included, enabling research on topics such as annotator-specific preference modeling for segmentation and annotator metadata analysis. We provide an analysis on the characteristics of this dataset, curated data partitions, and consensus segmentation masks.