Towards Secure and Efficient DNN Accelerators via Hardware-Software Co-Design

📅 2026-02-24
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the significant performance and energy efficiency degradation in existing DNN accelerators caused by security mechanisms that incur high hardware overhead and frequent off-chip memory accesses. To overcome these limitations, the authors propose a hardware-software co-designed secure memory protection framework that introduces a novel bandwidth-aware dynamic encryption granularity strategy. This approach leverages sliding-window overlap analysis to eliminate redundant memory accesses induced by intra- and inter-layer tiling, and incorporates a multi-level memory authentication mechanism to minimize unnecessary off-chip communication. Experimental evaluations on both server-class and edge NPUs demonstrate that the proposed method achieves strong security guarantees while reducing performance overhead by over 12% and improving energy efficiency by up to 87%, with excellent scalability across diverse architectures.

Technology Category

Application Category

📝 Abstract
The rapid deployment of deep neural network (DNN) accelerators in safety-critical domains such as autonomous vehicles, healthcare systems, and financial infrastructure necessitates robust mechanisms to safeguard data confidentiality and computational integrity. Existing security solutions for DNN accelerators, however, suffer from excessive hardware resource demands and frequent off-chip memory access overheads, which degrade performance and scalability. To address these challenges, this paper presents a secure and efficient memory protection framework for DNN accelerators with minimal overhead. First, we propose a bandwidth-aware cryptographic scheme that adapts encryption granularity based on memory traffic patterns, striking a balance between security and resource efficiency. Second, we observe that both the overlapping regions in the intra-layer tiling's sliding window pattern and those resulting from inter-layer tiling strategy discrepancies introduce substantial redundant memory accesses and repeated computational overhead in cryptography. Third, we introduce a multi-level authentication mechanism that effectively eliminates unnecessary off-chip memory accesses, enhancing performance and energy efficiency. Experimental results show that this work decreases performance overhead by over 12% and achieves 87% energy efficiency improvement for both server and edge neural processing units (NPUs), while ensuring robust scalability.
Problem

Research questions and friction points this paper is trying to address.

DNN accelerators
data confidentiality
computational integrity
memory access overhead
hardware resource overhead
Innovation

Methods, ideas, or system contributions that make the work stand out.

hardware-software co-design
bandwidth-aware cryptography
memory access redundancy reduction
multi-level authentication
DNN accelerator security
🔎 Similar Papers
No similar papers found.
W
Wei Xuan
ACCESS – AI Chip Center for Emerging Smart Systems, InnoHK Centers, Hong Kong Science Park and the Hong Kong University of Science and Technology, Hong Kong, China
Z
Zihao Xuan
ACCESS – AI Chip Center for Emerging Smart Systems, InnoHK Centers, Hong Kong Science Park and the Hong Kong University of Science and Technology, Hong Kong, China
R
Rongliang Fu
The Chinese University of Hong Kong, Hong Kong 999077, China
Ning Lin
Ning Lin
Princeton University
HurricanesStorm SurgeClimate AdaptationCoastal ResilienceRisk Analysis
Kwunhang Wong
Kwunhang Wong
The University of Hong Kong
Differential PrivacyHardware Security
Z
Zikang Yuan
ACCESS – AI Chip Center for Emerging Smart Systems, InnoHK Centers, Hong Kong Science Park and the Hong Kong University of Science and Technology, Hong Kong, China
Lang Feng
Lang Feng
Nanyang Technological University
Reinforcement Learning
Zhongrui Wang
Zhongrui Wang
Southern University of Science and Technology
MemristorIn-memory ComputingAI accelerator
Tsung-Yi Ho
Tsung-Yi Ho
Chinese University of Hong Kong
Electronic Design AutomationMicrofluidicsTrustworthy Machine Learning
Y
Yuzhong Jiao
ACCESS – AI Chip Center for Emerging Smart Systems, InnoHK Centers, Hong Kong Science Park and the Hong Kong University of Science and Technology, Hong Kong, China
L
Luhong Liang
ACCESS – AI Chip Center for Emerging Smart Systems, InnoHK Centers, Hong Kong Science Park and the Hong Kong University of Science and Technology, Hong Kong, China