🤖 AI Summary
Skin lesion classification models exhibit fairness bias due to continuous variation in skin tone, which traditional discrete subgroup-based fairness methods fail to capture adequately.
Method: We propose a distribution-aware reweighting framework that models skin tone as a continuous variable, estimates its density via kernel density estimation (KDE), and introduces a distance-aware loss function that dynamically assigns sample weights based on local density and neighborhood distance in the tone distribution.
Contribution/Results: This work is the first to formally define and evaluate individual-level fairness from a continuous distribution perspective, overcoming limitations of discrete subgroup partitioning that ignore intra-group heterogeneity. Extensive experiments employ 12 statistical divergence measures—including Wasserstein distance and Hellinger distance—along with Fidelity Similarity for distribution-aware weighting. The method consistently improves classification accuracy for underrepresented skin tones and enhances overall fairness across both CNN and Transformer architectures, outperforming existing reweighting baselines.
📝 Abstract
Skin color has historically been a focal point of discrimination, yet fairness research in machine learning for medical imaging often relies on coarse subgroup categories, overlooking individual-level variations. Such group-based approaches risk obscuring biases faced by outliers within subgroups. This study introduces a distribution-based framework for evaluating and mitigating individual fairness in skin lesion classification. We treat skin tone as a continuous attribute rather than a categorical label, and employ kernel density estimation (KDE) to model its distribution. We further compare twelve statistical distance metrics to quantify disparities between skin tone distributions and propose a distance-based reweighting (DRW) loss function to correct underrepresentation in minority tones. Experiments across CNN and Transformer models demonstrate: (i) the limitations of categorical reweighting in capturing individual-level disparities, and (ii) the superior performance of distribution-based reweighting, particularly with Fidelity Similarity (FS), Wasserstein Distance (WD), Hellinger Metric (HM), and Harmonic Mean Similarity (HS). These findings establish a robust methodology for advancing fairness at individual level in dermatological AI systems, and highlight broader implications for sensitive continuous attributes in medical image analysis.