Flexible Sampling for Long-tailed Skin Lesion Classification

📅 2022-04-07

🏛️ International Conference on Medical Image Computing and Computer-Assisted Intervention

📈 Citations: 9

✨ Influential: 0

career value

203K/year

🤖 AI Summary

Skin lesion classification inherently exhibits a long-tailed distribution due to inter-subject variability and the rarity of certain diseases. Conventional long-tailed learning methods apply uniform class-level resampling, ignoring intra-class difficulty heterogeneity—thereby compromising discriminative capability. To address this, we propose a curriculum learning-driven dynamic sampling framework. It introduces a learnability-aware soft sampling mechanism anchored on class prototypes for difficulty estimation, coupled with a dynamic curriculum scheduling strategy that enables differential modeling and progressive training of easy versus hard samples. Evaluated under two long-tailed settings on the ISIC dataset, our method significantly outperforms existing state-of-the-art approaches, establishing new performance benchmarks for long-tailed skin lesion classification. This work is the first to integrate difficulty-aware sampling with prototype-guided curriculum learning, markedly enhancing discrimination for rare classes and overall generalization.

📝 Abstract

Most of the medical tasks naturally exhibit a long-tailed distribution due to the complex patient-level conditions and the existence of rare diseases. Existing long-tailed learning methods usually treat each class equally to re-balance the long-tailed distribution. However, considering that some challenging classes may present diverse intra-class distributions, re-balancing all classes equally may lead to a significant performance drop. To address this, in this paper, we propose a curriculum learning-based framework called Flexible Sampling for the long-tailed skin lesion classification task. Specifically, we initially sample a subset of training data as anchor points based on the individual class prototypes. Then, these anchor points are used to pre-train an inference model to evaluate the per-class learning difficulty. Finally, we use a curriculum sampling module to dynamically query new samples from the rest training samples with the learning difficulty-aware sampling probability. We evaluated our model against several state-of-the-art methods on the ISIC dataset. The results with two long-tailed settings have demonstrated the superiority of our proposed training strategy, which achieves a new benchmark for long-tailed skin lesion classification.

Problem

Research questions and friction points this paper is trying to address.

Addresses long-tailed distribution in skin lesion classification

Proposes flexible sampling to handle diverse intra-class distributions

Improves performance by dynamically adjusting sample selection

Innovation

Methods, ideas, or system contributions that make the work stand out.

Curriculum learning-based framework for classification

Dynamic sampling with learning difficulty-aware probability

Anchor points from class prototypes for pre-training

🔎 Similar Papers

Meta-Transfer Derm-Diagnosis: Exploring Few-Shot Learning and Transfer Learning for Skin Disease Classification in Long-Tail Distribution