ZOTTA: Test-Time Adaptation with Gradient-Free Zeroth-Order Optimization

📅 2026-03-15
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the limitations of existing test-time adaptation methods, which rely on backpropagation, incur high computational overhead, and are incompatible with non-differentiable models such as quantized networks, hindering deployment on edge devices. To overcome these challenges, the authors propose ZOTTA, a backpropagation-free test-time adaptation framework that leverages zeroth-order optimization (ZOO) using only forward passes for efficient adaptation. The key innovations include distributionally robust layer selection and spatial feature aggregation alignment, which jointly reduce the optimization dimensionality and enhance stability, enabling architecture-agnostic adaptation. Experimental results demonstrate that ZOTTA matches or surpasses gradient-based methods on ImageNet-C, ImageNet-R, ImageNet-Sketch, and ImageNet-A, while reducing memory consumption by 84% and improving accuracy by 3.9% on ImageNet-C.

Technology Category

Application Category

📝 Abstract
Test-time adaptation (TTA) aims to improve model robustness under distribution shifts by adapting to unlabeled test data, but most existing methods rely on backpropagation (BP), which is computationally costly and incompatible with non-differentiable models such as quantized models, limiting practical deployment on numerous edge devices. Recent BP-free approaches alleviate overhead but remain either architecture-specific or limited in optimization capacity to handle high-dimensional models. We propose ZOTTA, a fully BP-free TTA framework that performs efficient adaptation using only forward passes via Zeroth-Order Optimization (ZOO). While ZOO is theoretically appealing, naive application leads to slow convergence under high-dimensional parameter spaces and unstable optimization due to the lack of labels. ZOTTA overcomes these challenges through 1) Distribution-Robust Layer Selection, which automatically identifies and freezes layers that already extract distribution-invariant features, updating only domain-sensitive layers to reduce the optimization dimensionality and accelerate convergence; 2) Spatial Feature Aggregation Alignment, which stabilizes ZOO by aligning globally aggregated spatial features between source and target to reduce gradient variance. Together, these components enable architecture-agnostic and stable BP-free adaptation. Extensive experiments on ImageNet-C/R/Sketch/A show that ZOTTA outperforms or matches BP-based methods, e.g., it reduces memory usage by 84% and improves accuracy by 3.9% over SAR on ImageNet-C.
Problem

Research questions and friction points this paper is trying to address.

test-time adaptation
backpropagation-free
zeroth-order optimization
distribution shift
edge deployment
Innovation

Methods, ideas, or system contributions that make the work stand out.

Test-Time Adaptation
Zeroth-Order Optimization
Backpropagation-Free
Distribution Shift
Model Robustness
🔎 Similar Papers
No similar papers found.
Ronghao Zhang
Ronghao Zhang
Unknown affiliation
PsycholinguisticsComputaional Linguistics
Shuaicheng Niu
Shuaicheng Niu
Nanyang Technological University
Machine LearningDomain AdaptationRobustnessAutoML
Q
Qi Deng
South China University of Technology, Guangzhou, 510000, China
Yanjie Dong
Yanjie Dong
Associate Professor, Shenzhen MSU-BIT University
Machine learning and optimizationwireless for AI
J
Jian Chen
South China University of Technology, Guangzhou, 510000, China
R
Runhao Zeng
Shenzhen MSU-BIT University, Shenzhen, 518172, China