Progtuning: Progressive Fine-tuning Framework for Transformer-based Language Models

📅 2025-06-26
📈 Citations: 0
Influential: 0
📄 PDF

career value

193K/year
🤖 AI Summary
Full-parameter fine-tuning of large language models is computationally prohibitive, while existing parameter-efficient fine-tuning (PEFT) methods still require updating all Transformer layers, ignoring the heterogeneous contribution of individual layers. To address this, we propose Progtuning—a progressive fine-tuning framework featuring a novel, contribution-aware dynamic module selection mechanism. During training, Progtuning progressively freezes low-contribution Transformer blocks while selectively updating only high-contribution subsets, in staged fashion. Compatible with mainstream PEFT techniques, it reduces trainable parameters by 25% without sacrificing performance: it matches full fine-tuning accuracy across diverse downstream tasks. By integrating contribution-driven progressive learning, modular freezing, and hierarchical optimization, Progtuning significantly improves computational resource efficiency and adaptation flexibility.

Technology Category

Application Category

📝 Abstract
Fine-tuning is a promising technique for leveraging Transformer-based language models in downstream tasks. As model sizes continue to grow, updating all model parameters becomes increasingly costly. Parameter-efficient fine-tuning methods effectively address this issue by selectively updating a small subset of parameters. However, fine-tuning and most existing parameter-efficient fine-tuning methods require updating the same number of parameters as the initial size, ignoring the unequal contribution across Transformer blocks and leading to extremely inefficient allocation of computing resources. In this paper, we propose Progtuning, the novel fine-tuning framework combined with progressive learning for Transformer-based language models. Specifically, Progtuning progressively reduces the number of updated transformer blocks based on the contribution. Remarkably, Progtuning optimizes resource allocation and reduces the number of updated parameters by approximately 25%, while still maintaining competitive performance. And it also exhibits high adaptability with parameter-efficient fine-tuning methods, demonstrating excellent performance across various adaptation scenarios.
Problem

Research questions and friction points this paper is trying to address.

Optimize resource allocation in Transformer fine-tuning
Reduce updated parameters while maintaining performance
Improve efficiency of parameter-efficient fine-tuning methods
Innovation

Methods, ideas, or system contributions that make the work stand out.

Progressive reduction of updated transformer blocks
Optimizes resource allocation by 25%
Compatible with parameter-efficient fine-tuning methods
🔎 Similar Papers
No similar papers found.
Xiaoshuang Ji
Xiaoshuang Ji
Institute of Information Engineering, Chinese Academy of Sciences
Z
Zhendong Zhao
Institute of Information Engineering, Chinese Academy of Sciences, Beijing, China; Key Laboratory of Cyberspace Security Defense, Beijing, China
X
Xiaojun Chen
Institute of Information Engineering, Chinese Academy of Sciences, Beijing, China; Key Laboratory of Cyberspace Security Defense, Beijing, China
X
Xin Zhao
Institute of Information Engineering, Chinese Academy of Sciences, Beijing, China; School of Cyber Security, University of Chinese Academy of Sciences, Beijing, China; Key Laboratory of Cyberspace Security Defense, Beijing, China
Z
Zeyao Liu
Institute of Information Engineering, Chinese Academy of Sciences, Beijing, China; School of Cyber Security, University of Chinese Academy of Sciences, Beijing, China; Key Laboratory of Cyberspace Security Defense, Beijing, China