Flexible Sampling for Long-tailed Skin Lesion Classification

📅 2022-04-07
🏛️ International Conference on Medical Image Computing and Computer-Assisted Intervention
📈 Citations: 9
Influential: 0
📄 PDF
🤖 AI Summary
Skin lesion classification inherently exhibits a long-tailed distribution due to inter-subject variability and the rarity of certain diseases. Conventional long-tailed learning methods apply uniform class-level resampling, ignoring intra-class difficulty heterogeneity—thereby compromising discriminative capability. To address this, we propose a curriculum learning-driven dynamic sampling framework. It introduces a learnability-aware soft sampling mechanism anchored on class prototypes for difficulty estimation, coupled with a dynamic curriculum scheduling strategy that enables differential modeling and progressive training of easy versus hard samples. Evaluated under two long-tailed settings on the ISIC dataset, our method significantly outperforms existing state-of-the-art approaches, establishing new performance benchmarks for long-tailed skin lesion classification. This work is the first to integrate difficulty-aware sampling with prototype-guided curriculum learning, markedly enhancing discrimination for rare classes and overall generalization.
📝 Abstract
Most of the medical tasks naturally exhibit a long-tailed distribution due to the complex patient-level conditions and the existence of rare diseases. Existing long-tailed learning methods usually treat each class equally to re-balance the long-tailed distribution. However, considering that some challenging classes may present diverse intra-class distributions, re-balancing all classes equally may lead to a significant performance drop. To address this, in this paper, we propose a curriculum learning-based framework called Flexible Sampling for the long-tailed skin lesion classification task. Specifically, we initially sample a subset of training data as anchor points based on the individual class prototypes. Then, these anchor points are used to pre-train an inference model to evaluate the per-class learning difficulty. Finally, we use a curriculum sampling module to dynamically query new samples from the rest training samples with the learning difficulty-aware sampling probability. We evaluated our model against several state-of-the-art methods on the ISIC dataset. The results with two long-tailed settings have demonstrated the superiority of our proposed training strategy, which achieves a new benchmark for long-tailed skin lesion classification.
Problem

Research questions and friction points this paper is trying to address.

Addresses long-tailed distribution in skin lesion classification
Proposes flexible sampling to handle diverse intra-class distributions
Improves performance by dynamically adjusting sample selection
Innovation

Methods, ideas, or system contributions that make the work stand out.

Curriculum learning-based framework for classification
Dynamic sampling with learning difficulty-aware probability
Anchor points from class prototypes for pre-training
Lie Ju
Lie Ju
University College London; Moorfields Eye Hospital; Monash University
Computer VisionMedical Image AnalysisOphthalmology
Y
Yicheng Wu
Faculty of Information Technology, Monash University, Melbourne, Australia
L
Lin Wang
Monash-Airdoc Research, Monash University, Melbourne, Australia
Zhenjun Yu
Zhenjun Yu
Shanghai Jiao Tong University, MVIG
RoboticsTactile Sensing3D Vision3D Reconstruction
X
Xin Zhao
Monash-Airdoc Research, Monash University, Melbourne, Australia
X
Xin Wang
Monash-Airdoc Research, Monash University, Melbourne, Australia
P
Paul Bonnington
Monash Medical AI Group, Monash University, Melbourne, Australia
Z
Z. Ge
Monash-Airdoc Research, Monash University, Melbourne, Australia; Monash Medical AI Group, Monash University, Melbourne, Australia