DerMAE: Improving skin lesion classification through conditioned latent diffusion and MAE distillation

📅 2026-02-23
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of model bias in skin lesion classification caused by the scarcity of malignant samples. To mitigate class imbalance, the authors propose a novel framework that integrates class-conditional diffusion-based image generation, masked autoencoder (MAE) self-supervised pretraining, and knowledge distillation. High-quality synthetic images of malignant lesions are first generated using a conditional diffusion model. A Vision Transformer (ViT) is then pretrained via MAE on both real and synthetic data to learn robust representations. Finally, knowledge distillation transfers the learned capabilities from the large ViT to a lightweight student network, enabling high classification performance with efficient mobile deployment. This study presents the first systematic integration of generative modeling, self-supervised learning, and model compression, offering an effective solution for few-shot medical image classification.

Technology Category

Application Category

📝 Abstract
Skin lesion classification datasets often suffer from severe class imbalance, with malignant cases significantly underrepresented, leading to biased decision boundaries during deep learning training. We address this challenge using class-conditioned diffusion models to generate synthetic dermatological images, followed by self-supervised MAE pretraining to enable huge ViT models to learn robust, domain-relevant features. To support deployment in practical clinical settings, where lightweight models are required, we apply knowledge distillation to transfer these representations to a smaller ViT student suitable for mobile devices. Our results show that MAE pretraining on synthetic data, combined with distillation, improves classification performance while enabling efficient on-device inference for practical clinical use.
Problem

Research questions and friction points this paper is trying to address.

class imbalance
skin lesion classification
malignant underrepresentation
biased decision boundaries
Innovation

Methods, ideas, or system contributions that make the work stand out.

class-conditioned diffusion
MAE pretraining
knowledge distillation
skin lesion classification
lightweight ViT
🔎 Similar Papers
No similar papers found.
F
Francisco Filho
Centro de Informática, Universidade Federal de Pernambuco, Brazil
K
Kelvin Cunha
Centro de Informática, Universidade Federal de Pernambuco, Brazil
F
Fábio Papais
Centro de Informática, Universidade Federal de Pernambuco, Brazil
E
Emanoel dos Santos
Centro de Informática, Universidade Federal de Pernambuco, Brazil
R
Rodrigo Mota
Centro de Informática, Universidade Federal de Pernambuco, Brazil
T
Thales Bezerra
Centro de Informática, Universidade Federal de Pernambuco, Brazil
E
Erico Medeiros
Centro de Informática, Universidade Federal de Pernambuco, Brazil
Paulo Borba
Paulo Borba
Federal University of Pernambuco
Software EngineeringProgramming Languages
Tsang Ing Ren
Tsang Ing Ren
Center for Informatics - CIn, Federal University of Pernambuco - UFPE
Image ProcessingComputer VisionPattern RecognitionMachine Learning