DSPE: An Energy-Efficient Edge Processor for DeepSeek Inference with MerkleTree-based Incremental Pruning, Multi-Stage Boothing Lookup and Dynamic Adaptive Posit Processing

๐Ÿ“… 2026-05-08
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF

career value

229K/year
๐Ÿค– AI Summary
Deploying large DeepSeek models on edge devices faces significant challenges due to high computational overhead and energy consumption. This work proposes DSPE, a dedicated inference processor for edge deployment, which achieves substantial improvements in energy efficiency and accuracy through three synergistic innovations: Merkle Treeโ€“based incremental pruning for secure sparse computation, multi-level Boothing lookup for fault-tolerant approximate multiplication, and a dynamic adaptive Posit number format with a tailored hardware multiplier architecture. Implemented in TSMC 28nm CMOS technology, DSPE attains an energy efficiency of 109.4 TFLOPS/W, markedly outperforming existing solutions and providing a scalable hardware foundation for efficient and secure edge deployment of large language models.
๐Ÿ“ Abstract
In recent years, DeepSeek has achieved strong inference performance but remains hard to deploy on energy-constrained edge devices. This paper presents the DeepSeek Processing Element (DSPE), an edge-oriented architecture that alleviates the model's heavy computational and energy demands. DSPE introduces three techniques: the MerkleTree-based Incremental Pruning Scheme (MIPS) for secure redundant-vector reduction, the Multi-Stage Boothing Lookup Method (MBLM) for bit-flip-aware approximate multiplication, and the Dynamic Adaptive Posit Processing Mechanism (DAPPM), which introduces a new DA-Posit format and its corresponding hardware multiplication architecture. Implemented in TSMC 28nm CMOS, DSPE achieves 109.4 TFLOPS/W energy efficiency compared with state-of-the-art designs and offers a scalable foundation for edge deployment.
Problem

Research questions and friction points this paper is trying to address.

DeepSeek
edge computing
energy efficiency
model deployment
inference
Innovation

Methods, ideas, or system contributions that make the work stand out.

MerkleTree-based Incremental Pruning
Multi-Stage Boothing Lookup
Dynamic Adaptive Posit
Energy-Efficient Edge Processor
DeepSeek Inference
๐Ÿ”Ž Similar Papers
No similar papers found.
Y
Yuhan Zhang
School of Computer Science and Engineering, Northeastern University, Shenyang, China
Zhou Wang
Zhou Wang
National University of Singapore
MathemticsStatistics and Probability
Z
Zhou Shu
School of Microelectronics, Xidian University, Xiโ€™an, China; Hangzhou Institute of Technology, Xidian University, Hangzhou, China
J
Jiuren Zhou
School of Microelectronics, Xidian University, Xiโ€™an, China; Hangzhou Institute of Technology, Xidian University, Hangzhou, China
Y
Yanqing Xu
The Chinese University of Hong Kong, Shenzhen, Shenzhen, China
X
Xiaonan Tang
Wisemaytech Co., Ltd., Beijing, China
S
Shushan Qiao
Institute of Microelectronics, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China
T
Tianchun Ye
Institute of Microelectronics, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China
Yang Liu
Yang Liu
Nanyang Technological University
AgentSoftware EngineeringCyber SecurityTrustworthy AISoftware Security
A
Anil A. Bharath
Imperial College London, London, United Kingdom; Imperial Global Singapore, Singapore, Singapore
E
Emm Mic Drakakis
Imperial College London, London, United Kingdom; Imperial Global Singapore, Singapore, Singapore