DSPE: An Energy-Efficient Edge Processor for DeepSeek Inference with MerkleTree-based Incremental Pruning, Multi-Stage Boothing Lookup and Dynamic Adaptive Posit Processing

📅 2026-05-08

📈 Citations: 0

✨ Influential: 0

career value

204K/year

🤖 AI Summary

Deploying large DeepSeek models on edge devices faces significant challenges due to high computational overhead and energy consumption. This work proposes DSPE, a dedicated inference processor for edge deployment, which achieves substantial improvements in energy efficiency and accuracy through three synergistic innovations: Merkle Tree–based incremental pruning for secure sparse computation, multi-level Boothing lookup for fault-tolerant approximate multiplication, and a dynamic adaptive Posit number format with a tailored hardware multiplier architecture. Implemented in TSMC 28nm CMOS technology, DSPE attains an energy efficiency of 109.4 TFLOPS/W, markedly outperforming existing solutions and providing a scalable hardware foundation for efficient and secure edge deployment of large language models.

📝 Abstract

In recent years, DeepSeek has achieved strong inference performance but remains hard to deploy on energy-constrained edge devices. This paper presents the DeepSeek Processing Element (DSPE), an edge-oriented architecture that alleviates the model's heavy computational and energy demands. DSPE introduces three techniques: the MerkleTree-based Incremental Pruning Scheme (MIPS) for secure redundant-vector reduction, the Multi-Stage Boothing Lookup Method (MBLM) for bit-flip-aware approximate multiplication, and the Dynamic Adaptive Posit Processing Mechanism (DAPPM), which introduces a new DA-Posit format and its corresponding hardware multiplication architecture. Implemented in TSMC 28nm CMOS, DSPE achieves 109.4 TFLOPS/W energy efficiency compared with state-of-the-art designs and offers a scalable foundation for edge deployment.

Problem

Research questions and friction points this paper is trying to address.

DeepSeek

edge computing

energy efficiency

model deployment

inference

Innovation

Methods, ideas, or system contributions that make the work stand out.

MerkleTree-based Incremental Pruning

Multi-Stage Boothing Lookup

Dynamic Adaptive Posit