🤖 AI Summary
Visual tracking models exhibit insufficient robustness against adversarial attacks, leading to significant performance degradation. This paper proposes the first zero-shot, real-time defense framework leveraging a lightweight diffusion prior—requiring no adversarial training—to enhance perturbation resilience. Our method integrates the diffusion prior into a Siamese tracking architecture to perform adversarial purification directly in feature space. We further introduce a noise-aware feature calibration mechanism that jointly optimizes robustness and efficiency, and refine the noise scheduling strategy to minimize computational overhead. Evaluated on LaSOT and GOT-10k benchmarks under diverse adversarial attacks, our approach achieves a 23.6% improvement in mean Average Precision (mAP) while incurring only a 1.8 ms increase in inference latency—substantially outperforming existing defense methods in both accuracy and efficiency.