Adversarial Patch Generation for Visual-Infrared Dense Prediction Tasks via Joint Position-Color Optimization

๐Ÿ“… 2026-02-27
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
Existing adversarial patch methods for visual-infrared dense prediction tasks suffer from weak attack efficacy and poor stealthiness due to their neglect of cross-spectral inconsistencies. This work proposes AP-PCO, a novel framework that introduces, for the first time, a joint position-color optimization mechanism to simultaneously perturb both visual and infrared modalities under a black-box setting without requiring internal model information. The method leverages a model-output-driven fitness function for optimization and incorporates a cross-modal color adaptation strategy to effectively reduce patch saliency in both spectral domains. Extensive experiments demonstrate that AP-PCO achieves highly effective, stealthy, and generalizable attacks across various visual-infrared dense prediction models, establishing a new benchmark for evaluating the robustness of multimodal perception systems.

Technology Category

Application Category

๐Ÿ“ Abstract
Multimodal adversarial attacks for dense prediction remain largely underexplored. In particular, visual-infrared (VI) perception systems introduce unique challenges due to heterogeneous spectral characteristics and modality-specific intensity distributions. Existing adversarial patch methods are primarily designed for single-modal inputs and fail to account for crossspectral inconsistencies, leading to reduced attack effectiveness and poor stealthiness when applied to VI dense prediction models. To address these challenges, we propose a joint position-color optimization framework (AP-PCO) for generating adversarial patches in visual-infrared settings. The proposed method optimizes patch placement and color composition simultaneously using a fitness function derived from model outputs, enabling a single patch to perturb both visible and infrared modalities. To further bridge spectral discrepancies, we introduce a crossmodal color adaptation strategy that constrains patch appearance according to infrared grayscale characteristics while maintaining strong perturbations in the visible domain, thereby reducing cross-spectral saliency. The optimization procedure operates without requiring internal model information, supporting flexible black-box attacks. Extensive experiments on visual-infrared dense prediction tasks demonstrate that the proposed AP-PCO achieves consistently strong attack performance across multiple architectures, providing a practical benchmark for robustness evaluation in VI perception systems.
Problem

Research questions and friction points this paper is trying to address.

adversarial patch
visual-infrared
dense prediction
multimodal attack
cross-spectral inconsistency
Innovation

Methods, ideas, or system contributions that make the work stand out.

adversarial patch
visual-infrared perception
joint position-color optimization
crossmodal color adaptation
black-box attack
๐Ÿ”Ž Similar Papers
He Li
He Li
Yanshan University
่ฎก็ฎ—ๆœบ่ง†่ง‰ใ€ๆจกๅผ่ฏ†ๅˆซ
W
Wenyue He
School of Artificial Intelligence (School of Software), Yanshan University, Qinhuangdao 066004, China
W
Weihang Kong
School of Artificial Intelligence (School of Software), Yanshan University, Qinhuangdao 066004, China
Xingchen Zhang
Xingchen Zhang
Senior Lecturer and Director of the Fusion Intelligence Lab, University of Exeter
Fusion IntelligenceHuman-centered AIEmbodied AIPrivacy-preserving AIMedical AI