FPGA-Optimized Hardware Accelerator for Fast Fourier Transform and Singular Value Decomposition in AI

📅 2025-04-14
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address high latency and low energy efficiency in FFT and SVD computations within AI models, this paper proposes a reconfigurable hardware acceleration architecture tailored for Xilinx FPGAs. The architecture introduces a novel integration of dataflow control, watermark embedding, and tightly coupled FFT/SVD compute units enabling dynamic reconfiguration. It employs pipelined CORDIC arithmetic, customized memory access patterns, and parallelized SVD decomposition to jointly optimize throughput, security, and robustness. Experimental evaluation demonstrates 12.3× and 9.6× speedups for FFT and SVD, respectively, over CPU/GPU software implementations, along with an 8.2× improvement in energy efficiency. This work establishes an efficient, secure, and scalable hardware acceleration paradigm for frequency-domain and matrix-decomposition-intensive workloads in AI systems.

Technology Category

Application Category

📝 Abstract
This research introduces an FPGA-based hardware accelerator to optimize the Singular Value Decomposition (SVD) and Fast Fourier transform (FFT) operations in AI models. The proposed design aims to improve processing speed and reduce computational latency. Through experiments, we validate the performance benefits of the hardware accelerator and show how well it handles FFT and SVD operations. With its strong security and durability, the accelerator design achieves significant speedups over software implementations, thanks to its modules for data flow control, watermark embedding, FFT, and SVD.
Problem

Research questions and friction points this paper is trying to address.

Optimizing SVD and FFT operations in AI models
Improving processing speed and reducing computational latency
Validating performance benefits of FPGA-based hardware accelerator
Innovation

Methods, ideas, or system contributions that make the work stand out.

FPGA-based hardware accelerator
Optimizes FFT and SVD operations
Enhances speed and reduces latency
🔎 Similar Papers
No similar papers found.
Hong Ding
Hong Ding
Tsung-Dao Lee Institute, Shanghai Jiao Tong University
condensed matter physics
Chia Chao Kang
Chia Chao Kang
Xiamen University Malaysia
Wireless CommunicationRenewable EnergyIoTHealthcare
S
Suyang Xi
School of Electrical Engineering and Artificial Intelligence, Xiamen University Malaysia, Sepang, Malaysia
Z
Zehang Liu
School of Computer and Data Science, Xiamen University Malaysia, Sepang, Malaysia
X
Xuan Zhang
School of Computer and Data Science, Xiamen University Malaysia, Sepang, Malaysia
Y
Yi Ding
School of Economics and Management, Xiamen University Malaysia, Sepang, Malaysia