Quantization-Aware Neuromorphic Architecture for Efficient Skin Disease Classification on Resource-Constrained Devices

📅 2025-07-21
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the challenges of resource constraints, high energy consumption, and privacy sensitivity in skin lesion classification on edge devices, this paper proposes QANA—a quantization-aware spiking neural network. QANA integrates Ghost modules for parameter efficiency, Squeeze-and-Excitation channel-wise attention for discriminative feature enhancement, and a spike-compatible quantization-aware training mechanism enabling end-to-end optimization and efficient deployment on neuromorphic hardware (e.g., BrainChip Akida). Evaluated on HAM10000 and a clinical dataset, QANA achieves 91.6% Top-1 accuracy and 82.4% macro-F1 score, respectively. On-device deployment yields 1.5 ms inference latency and 1.7 mJ per inference—reducing energy consumption by over 98.6% compared to GPU-based inference. To our knowledge, QANA is the first framework to jointly optimize accuracy, energy efficiency, and hardware deployability for edge dermatology diagnosis, establishing a new paradigm for lightweight, privacy-preserving skin lesion analysis.

Technology Category

Application Category

📝 Abstract
Accurate and efficient skin lesion classification on edge devices is critical for accessible dermatological care but remains challenging due to computational, energy, and privacy constraints. We introduce QANA, a novel quantization-aware neuromorphic architecture for incremental skin lesion classification on resource-limited hardware. QANA effectively integrates ghost modules, efficient channel attention, and squeeze-and-excitation blocks for robust feature representation with low-latency and energy-efficient inference. Its quantization-aware head and spike-compatible transformations enable seamless conversion to spiking neural networks (SNNs) and deployment on neuromorphic platforms. Evaluation on the large-scale HAM10000 benchmark and a real-world clinical dataset shows that QANA achieves 91.6% Top-1 accuracy and 82.4% macro F1 on HAM10000, and 90.8% / 81.7% on the clinical dataset, significantly outperforming state-of-the-art CNN-to-SNN models under fair comparison. Deployed on BrainChip Akida hardware, QANA achieves 1.5,ms inference latency and 1.7,mJ energy per image, reducing inference latency and energy use by over 94.6%/98.6% compared to GPU-based CNNs surpassing state-of-the-art CNN-to-SNN conversion baselines. These results demonstrate the effectiveness of QANA for accurate, real-time, and privacy-sensitive medical analysis in edge environments.
Problem

Research questions and friction points this paper is trying to address.

Efficient skin disease classification on edge devices
Low-latency, energy-efficient inference for resource-limited hardware
Seamless conversion to spiking neural networks for deployment
Innovation

Methods, ideas, or system contributions that make the work stand out.

Quantization-aware neuromorphic architecture for edge devices
Integrates ghost modules and efficient attention blocks
Enables seamless conversion to spiking neural networks
🔎 Similar Papers
No similar papers found.
Haitian Wang
Haitian Wang
University of Western Australia
3D point cloudComputer visionMachine leaningIoTRemote sensing
X
Xinyu Wang
The University of Western Australia, Perth, WA 6009, Australia
Yiren Wang
Yiren Wang
Microsoft Research
Machine TranslationNatural Language ProcessingMachine Learning
K
Karen Lee
The University of Western Australia, Perth, WA 6009, Australia
Z
Zichen Geng
The University of Western Australia, Perth, WA 6009, Australia
X
Xian Zhang
The University of Western Australia, Perth, WA 6009, Australia
K
Kehkashan Kiran
Northwestern Polytechnical University, Xi’an, Shaanxi 710129, China
Y
Yu Zhang
Northwestern Polytechnical University, Xi’an, Shaanxi 710129, China
B
Bo Miao
Australian Institute for Machine Learning, University of Adelaide, SA 5005, Australia