Enhancing Few-Shot Transfer Learning with Optimized Multi-Task Prompt Tuning through Modular Prompt Composition

📅 2024-08-23

🏛️ arXiv.org

📈 Citations: 0

✨ Influential: 0

career value

178K/year

🤖 AI Summary

To address the weak knowledge transfer capability of prompt tuning in multi-task few-shot learning, this paper proposes a modular prompt composition framework. It decouples task-specific prompts into cross-task shareable source prompts and task-exclusive prompts, enabling flexible multi-source fusion via weighted combination, gating mechanisms, and attention. This disentangled design is the first to systematically support efficient parameter reuse and collaborative fine-tuning of prompts. Evaluated on GLUE under few-shot settings, it achieves substantial gains in accuracy and robustness, surpassing existing prompt tuning methods with minimal labeled data and attaining state-of-the-art performance. The core contribution lies in uncovering and modeling the hierarchical transferable structure inherent in prompts, thereby establishing a novel paradigm for parameter-efficient multi-task learning.

Technology Category

Application Category

📝 Abstract

In recent years, multi-task prompt tuning has garnered considerable attention for its inherent modularity and potential to enhance parameter-efficient transfer learning across diverse tasks. This paper aims to analyze and improve the performance of multiple tasks by facilitating the transfer of knowledge between their corresponding prompts in a multi-task setting. Our proposed approach decomposes the prompt for each target task into a combination of shared prompts (source prompts) and a task-specific prompt (private prompt). During training, the source prompts undergo fine-tuning and are integrated with the private prompt to drive the target prompt for each task. We present and compare multiple methods for combining source prompts to construct the target prompt, analyzing the roles of both source and private prompts within each method. We investigate their contributions to task performance and offer flexible, adjustable configurations based on these insights to optimize performance. Our empirical findings clearly showcase improvements in accuracy and robustness compared to the conventional practice of prompt tuning and related works. Notably, our results substantially outperform other methods in the field in few-shot settings, demonstrating superior performance in various tasks across GLUE benchmark, among other tasks. This achievement is attained with a significantly reduced amount of training data, making our method a promising one for few-shot settings.

Problem

Research questions and friction points this paper is trying to address.

Optimizing multi-task prompt tuning for knowledge transfer

Decomposing prompts into shared and task-specific components

Improving few-shot learning accuracy with reduced training data

Innovation

Methods, ideas, or system contributions that make the work stand out.

Modular prompt composition for knowledge transfer

Combining shared and task-specific prompts adaptively

Optimized multi-task tuning with reduced training data

🔎 Similar Papers

No similar papers found.