Dissociating spatial frequency reliance from adversarial robustness advantages in neurally guided deep convolutional neural networks

📅 2026-05-05
📈 Citations: 0
Influential: 0
📄 PDF

career value

229K/year
📝 Abstract
Deep convolutional neural networks (DCNNs) have rivaled humans on many visual tasks, yet they remain vulnerable to near-imperceptible perturbations generated by adversarial attacks. Recent work shows that aligning DCNN representations with human visual cortex activity improves adversarial robustness, but the mechanisms driving this advantage are unclear. One hypothesis suggests that neural alignment confers robustness by biasing models away from brittle high-frequency details and towards the low spatial frequencies (LSF). However, recent work shows that human object recognition critically depends on a narrow, mid-frequency "human channel". Interestingly, this band was partially preserved in prior LSF-focused studies. Here, we investigate whether a spectral bias towards the LSF or the human channel is the primary driver of the adversarial robustness observed in neurally aligned DCNNs. We first show that DCNNs aligned to higher-order regions of the human ventral visual stream systematically increase reliance on both LSF and the human channel. However, directly steering DCNNs towards these bands revealed a clear dissociation. Biasing models towards the human channel, either alone or together with LSF, does not improve robustness and even impairs it. LSF bias produced some robustness gains, but such improvements are modest despite inducing much larger shifts in spatial-frequency reliance than neurally aligned models. Spatial-frequency-biased models overall show little, if any, increase in similarity to human neural representational geometry. Together, our results suggest that altered spatial-frequency reliance is likely an emergent property of learning more human-like representations rather than the primary mechanism by which neural alignment confers adversarial robustness, and motivate the need for future research examining representational properties beyond spatial-frequency profiles.
Problem

Research questions and friction points this paper is trying to address.

adversarial robustness
spatial frequency
neural alignment
human visual cortex
deep convolutional neural networks
Innovation

Methods, ideas, or system contributions that make the work stand out.

neural alignment
adversarial robustness
spatial frequency
human visual cortex
representational geometry
Z
Zhenan Shao
Department of Psychology, University of Illinois Urbana-Champaign; The Beckman Institute, University of Illinois Urbana-Champaign; Department of Cognitive Science, Johns Hopkins University
Tianyu Ren
Tianyu Ren
The University of Manchester
Reinforcement LearningMulti-AgentEvolutionary DynamicGame Theory
C
Chengxiao Wang
Department of Computer Science, University of Illinois Urbana-Champaign
Leyla Isik
Leyla Isik
Johns Hopkins University
Cognitive neuroscienceComputational neuroscienceVisionMachine LearningSocial perception
D
Diane M. Beck
Department of Psychology, University of Illinois Urbana-Champaign; The Beckman Institute, University of Illinois Urbana-Champaign