Joint Optimization of Model Partitioning and Resource Allocation for Anti-Jamming Collaborative Inference Systems

๐Ÿ“… 2026-03-02
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This work addresses the vulnerability of intermediate features to malicious interference during wireless transmission in edge-assisted deep inference on resource-constrained devices, which severely degrades both accuracy and latency performance. To mitigate this issue, the paper pioneers the integration of anti-jamming objectives into edge-device collaborative inference systems and proposes a joint optimization framework aimed at maximizing system utilityโ€”defined as a trade-off between inference accuracy and delay. The framework co-optimizes model partitioning, computational resource allocation, and transmit power. To solve the resulting mixed-integer nonlinear problem, an alternating optimization strategy is devised, leveraging KKT conditions, convex optimization, and a quantum-inspired genetic algorithm to efficiently handle subproblems. Extensive simulations demonstrate that the proposed scheme significantly outperforms existing baselines in terms of system utility (RDA).

Technology Category

Application Category

๐Ÿ“ Abstract
With the increasing computational demands of deep neural network (DNN) inference on resource-constrained devices, DNN partitioning-based device-edge collaborative inference has emerged as a promising paradigm. However, the transmission of intermediate feature data is vulnerable to malicious jamming, which significantly degrades the overall inference performance. To counter this threat, this letter focuses on an anti-jamming collaborative inference system in the presence of a malicious jammer. In this system, a DNN model is partitioned into two distinct segments, which are executed by wireless devices and edge servers, respectively. We first analyze the effects of jamming and DNN partitioning on inference accuracy via data regression. Based on this, our objective is to maximize the system's revenue of delay and accuracy (RDA) under inference accuracy and computing resource constraints by jointly optimizing computation resource allocation, devices' transmit power, and DNN partitioning. To address the mixed-integer nonlinear programming problem, we propose an efficient alternating optimization-based algorithm, which decomposes the problem into three subproblems that are solved via Karush-Kuhn-Tucker conditions, convex optimization methods, and a quantum genetic algorithm, respectively. Extensive simulations demonstrate that our proposed scheme outperforms baselines in terms of RDA.
Problem

Research questions and friction points this paper is trying to address.

anti-jamming
collaborative inference
DNN partitioning
resource allocation
jamming attack
Innovation

Methods, ideas, or system contributions that make the work stand out.

DNN partitioning
anti-jamming
collaborative inference
resource allocation
quantum genetic algorithm
๐Ÿ”Ž Similar Papers
No similar papers found.
M
Mengru Wu
College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
J
Jiawei Li
College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
Jiaqi Wei
Jiaqi Wei
PhD student, Zhejiang University
NLPLLMAI for Science
B
Bin Lyu
School of Communications and Information Engineering, Nanjing University of Posts and Telecommunications, Nanjing 210003, China
K
Kai-Kit Wong
Department of Electronic and Electrical Engineering, University College London, WC1E 7JE London, U.K., and also with the Department of Electronic Engineering, Kyung Hee University, Yongin-si, Gyeonggi-do 17104, Republic of Korea
Hyundong Shin
Hyundong Shin
Professor, Department of Electronic Engineering, Kyung Hee University
Quantum Information ScienceWireless CommunicationMachine Intelligence