Preserving Domain Generalization in Fine-Tuning via Joint Parameter Selection

📅 2025-08-23
📈 Citations: 0
Influential: 0
📄 PDF

career value

222K/year
🤖 AI Summary
Fine-tuning pretrained models often degrades their domain generalization capability. To address this, we propose a joint parameter selection mechanism that updates only a sparse, cross-domain gradient-consistent subset of parameters during fine-tuning. Grounded in theoretical analysis linking parameter sparsity to generalization error, our method introduces a dual-operator gradient consistency filter to dynamically identify parameters with high generalization potential for update. This strategy integrates parameter-efficient fine-tuning with gradient-driven structured sparsity, preserving the model’s original domain generalization while enhancing task-specific adaptation. Extensive experiments on multiple standard domain generalization benchmarks demonstrate significant improvements over existing state-of-the-art methods, validating the effectiveness of our approach in balancing generalization and adaptability.

Technology Category

Application Category

📝 Abstract
Domain generalization seeks to develop models trained on a limited set of source domains that are capable of generalizing effectively to unseen target domains. While the predominant approach leverages large-scale pre-trained vision models as initialization, recent studies have highlighted that full fine-tuning can compromise the intrinsic generalization capabilities of these models. To address this limitation, parameter-efficient adaptation strategies have emerged, wherein only a subset of model parameters is selectively fine-tuned, thereby balancing task adaptation with the preservation of generalization. Motivated by this paradigm, we introduce Joint Parameter Selection (JPS), a novel method that restricts updates to a small, sparse subset of parameters, thereby retaining and harnessing the generalization strength of pre-trained models. Theoretically, we establish a generalization error bound that explicitly accounts for the sparsity of parameter updates, thereby providing a principled justification for selective fine-tuning. Practically, we design a selection mechanism employing dual operators to identify and update parameters exhibiting consistent and significant gradients across all source domains. Extensive benchmark experiments demonstrate that JPS achieves superior performance compared to state-of-the-art domain generalization methods, substantiating both the efficiency and efficacy of the proposed approach.
Problem

Research questions and friction points this paper is trying to address.

Preserve generalization in fine-tuning pre-trained models
Select sparse parameter subset for efficient adaptation
Balance task adaptation with domain generalization capability
Innovation

Methods, ideas, or system contributions that make the work stand out.

Joint Parameter Selection method
Sparse parameter updates strategy
Dual operators gradient mechanism
🔎 Similar Papers
No similar papers found.
Bin Pan
Bin Pan
Nankai University
Remote Sensing
S
Shiyu Shen
School of Statistics and Data Science, KLMDASR, LEBPS, and LPMC, Nankai University, Tianjin 300071, China
Z
Zongbin Wang
School of Statistics and Data Science, KLMDASR, LEBPS, and LPMC, Nankai University, Tianjin 300071, China
Z
Zhenwei Shi
Image Processing Center, School of Astronautics, Beihang University, Beijing 100191, China
X
Xia Xu
School of Computer Science and Technology, Tiangong University, Tianjin 300387, China