Mixed-Precision Quantization: Make the Best Use of Bits Where They Matter Most

๐Ÿ“… 2024-12-04
๐Ÿ›๏ธ arXiv.org
๐Ÿ“ˆ Citations: 1
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This study addresses the optimal bit-width allocation problem in mixed-precision quantization. Methodologically, it introduces a novel search-based framework featuring two variants of particle swarm optimization: Penalty-Driven PSO (PPSO) and Greedy-Criterion PSO (GC-PSO), both rigorously modeling integer constraints and proven convergent via dynamical systems theory. The key contribution is the first unified bit-allocation paradigm spanning diverse domainsโ€”digital signal processing (FIR filters), wireless communications (receivers), and machine learning (gradient descent). Experimental results demonstrate that the proposed approach achieves superior trade-offs between quantization accuracy and hardware resource efficiency, significantly outperforming state-of-the-art quantization methods across benchmarks.

Technology Category

Application Category

๐Ÿ“ Abstract
Mixed-precision quantization offers superior performance to fixed-precision quantization. It has been widely used in signal processing, communication systems, and machine learning. In mixed-precision quantization, bit allocation is essential. Hence, in this paper, we propose a new bit allocation framework for mixed-precision quantization from a search perspective. First, we formulate a general bit allocation problem for mixed-precision quantization. Then we introduce the penalized particle swarm optimization (PPSO) algorithm to address the integer consumption constraint. To improve efficiency and avoid iterations on infeasible solutions within the PPSO algorithm, a greedy criterion particle swarm optimization (GC-PSO) algorithm is proposed. The corresponding convergence analysis is derived based on dynamical system theory. Furthermore, we apply the above framework to some specific classic fields, i.e., finite impulse response (FIR) filters, receivers, and gradient descent. Numerical examples in each application underscore the superiority of the proposed framework to the existing algorithms.
Problem

Research questions and friction points this paper is trying to address.

Optimize bit allocation in mixed-precision quantization systems
Develop efficient algorithms for constrained integer bit allocation
Apply framework to FIR filters, receivers, and gradient descent
Innovation

Methods, ideas, or system contributions that make the work stand out.

Proposes a new bit allocation framework
Introduces penalized particle swarm optimization
Develops greedy criterion PSO algorithm
๐Ÿ”Ž Similar Papers
No similar papers found.
Y
Yiming Fang
CAS Key Laboratory of Wireless-Optical Communications, University of Science and Technology of China, Hefei 230027, China
L
Li Chen
CAS Key Laboratory of Wireless-Optical Communications, University of Science and Technology of China, Hefei 230027, China
Yunfei Chen
Yunfei Chen
Department of Engineering, University of Durham
Wireless Communications
W
Weidong Wang
CAS Key Laboratory of Wireless-Optical Communications, University of Science and Technology of China, Hefei 230027, China
Changsheng You
Changsheng You
Southern University of Science and Technology, Clarivate Highly Cited Researcher
Edge computing and intelligenceintelligent reflecting surfacenear-field communicationsUAV