Gradient Alignment Improves Test-Time Adaptation for Medical Image Segmentation

📅 2024-08-14
🏛️ arXiv.org
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
In medical image segmentation, cross-center domain shift severely degrades the generalization of pre-trained models on unseen hospital data; existing test-time adaptation (TTA) methods suffer from suboptimal fine-tuning due to gradient direction bias and fixed learning rates. To address this, we propose GraTa, a gradient-aligned TTA method. GraTa introduces a novel pseudo-gradient–auxiliary-gradient alignment mechanism to jointly rectify optimization directions, and designs a cosine-similarity-based dynamic learning rate strategy to enhance convergence and robustness. Operating entirely without additional annotations, GraTa integrates self-supervised pseudo-labeling into the TTA framework. Extensive experiments across multiple medical image segmentation benchmarks demonstrate that GraTa consistently outperforms state-of-the-art TTA methods. The source code and pre-trained weights are publicly available.

Technology Category

Application Category

📝 Abstract
Although recent years have witnessed significant advancements in medical image segmentation, the pervasive issue of domain shift among medical images from diverse centres hinders the effective deployment of pre-trained models. Many Test-time Adaptation (TTA) methods have been proposed to address this issue by fine-tuning pre-trained models with test data during inference. These methods, however, often suffer from less-satisfactory optimization due to suboptimal optimization direction (dictated by the gradient) and fixed step-size (predicated on the learning rate). In this paper, we propose the Gradient alignment-based Test-time adaptation (GraTa) method to improve both the gradient direction and learning rate in the optimization procedure. Unlike conventional TTA methods, which primarily optimize the pseudo gradient derived from a self-supervised objective, our method incorporates an auxiliary gradient with the pseudo one to facilitate gradient alignment. Such gradient alignment enables the model to excavate the similarities between different gradients and correct the gradient direction to approximate the empirical gradient related to the current segmentation task. Additionally, we design a dynamic learning rate based on the cosine similarity between the pseudo and auxiliary gradients, thereby empowering the adaptive fine-tuning of pre-trained models on diverse test data. Extensive experiments establish the effectiveness of the proposed gradient alignment and dynamic learning rate and substantiate the superiority of our GraTa method over other state-of-the-art TTA methods on a benchmark medical image segmentation task. The code and weights of pre-trained source models are available at https://github.com/Chen-Ziyang/GraTa.
Problem

Research questions and friction points this paper is trying to address.

Medical Image Segmentation
Domain Adaptation
Fine-tuning
Innovation

Methods, ideas, or system contributions that make the work stand out.

GraTa
Dynamic Learning Rate
Cross-Hospital Adaptation
🔎 Similar Papers
No similar papers found.
Ziyang Chen
Ziyang Chen
Peking University
Quantum key distributionQuantum random number generation
Y
Yiwen Ye
National Engineering Laboratory for Integrated Aero-Space-Ground-Ocean Big Data Application Technology, School of Computer Science and Engineering, Northwestern Polytechnical University, Xi’an, China
Yongsheng Pan
Yongsheng Pan
Northwestern Polytechnical University
Y
Yong Xia
National Engineering Laboratory for Integrated Aero-Space-Ground-Ocean Big Data Application Technology, School of Computer Science and Engineering, Northwestern Polytechnical University, Xi’an, China; Research & Development Institute of Northwestern Polytechnical University in Shenzhen, Shenzhen, China; Ningbo Institute of Northwestern Polytechnical University, Ningbo, China