Towards Effective and Efficient Adversarial Defense with Diffusion Models for Robust Visual Tracking

📅 2025-05-31
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Visual tracking models exhibit insufficient robustness against adversarial attacks, leading to significant performance degradation. This paper proposes the first zero-shot, real-time defense framework leveraging a lightweight diffusion prior—requiring no adversarial training—to enhance perturbation resilience. Our method integrates the diffusion prior into a Siamese tracking architecture to perform adversarial purification directly in feature space. We further introduce a noise-aware feature calibration mechanism that jointly optimizes robustness and efficiency, and refine the noise scheduling strategy to minimize computational overhead. Evaluated on LaSOT and GOT-10k benchmarks under diverse adversarial attacks, our approach achieves a 23.6% improvement in mean Average Precision (mAP) while incurring only a 1.8 ms increase in inference latency—substantially outperforming existing defense methods in both accuracy and efficiency.

Technology Category

Application Category

Problem

Research questions and friction points this paper is trying to address.

Improving robustness of visual tracking against adversarial attacks
Proposing diffusion-based defense method for real-time performance
Combining multi-scale losses to suppress adversarial perturbations
Innovation

Methods, ideas, or system contributions that make the work stand out.

Denoise diffusion models for adversarial defense
Multi-scale loss mechanism suppresses perturbations
Real-time robust tracking at 30 FPS
🔎 Similar Papers
No similar papers found.
Long Xu
Long Xu
Ningbo University, Peng Cheng Laboratory
image/signal processingvideo codingespecially rate control of video codingimage/signal
P
Peng Gao
Qufu Normal University
W
Wen-Jia Tang
Qufu Normal University
F
Fei Wang
Harbin Institute of Technology Shenzhen
R
Ru-Yue Yuan