MoRe Fine-Tuning with 10x Fewer Parameters

📅 2024-08-30

🏛️ arXiv.org

📈 Citations: 0

✨ Influential: 0

career value

200K/year

🤖 AI Summary

Existing parameter-efficient fine-tuning (PEFT) methods—such as LoRA—rely on heuristic adapter architectures, suffering from poor generalization and limited transferability across models. Method: We propose the first learnable rectangular adapter search framework grounded in Monarch matrices—the first application of Monarch structure to PEFT—supported by theoretical analysis demonstrating superior expressivity over LoRA. Our approach employs differentiable neural architecture search to automatically discover optimal lightweight adapter topologies, eliminating manual specification of rank or module shape, and integrates low-parameter adapter design with efficient fine-tuning strategies. Contribution/Results: On multi-task and multi-model benchmarks, our method significantly outperforms state-of-the-art PEFT approaches, achieving comparable or superior performance using only 5% of LoRA’s parameters. It delivers both strong cross-task/model generalization and exceptional parameter efficiency.

Technology Category

Application Category

📝 Abstract

Parameter-efficient fine-tuning (PEFT) techniques have unlocked the potential to cheaply and easily specialize large pretrained models. However, the most prominent approaches, like low-rank adapters (LoRA), depend on heuristics or rules-of-thumb for their architectural choices -- potentially limiting their performance for new models and architectures. This limitation suggests that techniques from neural architecture search could be used to obtain optimal adapter architectures, but these are often expensive and difficult to implement. We address this challenge with Monarch Rectangular Fine-tuning (MoRe), a simple framework to search over adapter architectures that relies on the Monarch matrix class. Theoretically, we show that MoRe is more expressive than LoRA. Empirically, our approach is more parameter-efficient and performant than state-of-the-art PEFTs on a range of tasks and models, with as few as 5% of LoRA's parameters.

Problem

Research questions and friction points this paper is trying to address.

Optimizing adapter architectures for parameter-efficient fine-tuning

Reducing reliance on heuristics in low-rank adapters (LoRA)

Enhancing performance with fewer parameters in PEFT techniques

Innovation

Methods, ideas, or system contributions that make the work stand out.

Monarch Rectangular Fine-tuning (MoRe) framework

Searches over adapter architectures efficiently

More expressive and parameter-efficient than LoRA

🔎 Similar Papers

Why Lift so Heavy? Slimming Large Language Models by Cutting Off the Layers

2024-02-18arXiv.orgCitations: 2

💼 Related Jobs

Research Engineer, Monetization AI