Towards Federated Low-Rank Adaptation of Language Models with Rank Heterogeneity

📅 2024-06-25
📈 Citations: 1
Influential: 0
📄 PDF

career value

209K/year
🤖 AI Summary
In federated learning (FL), low-rank adaptation (LoRA) of large language models suffers from client-wise rank heterogeneity—i.e., clients employing disparate LoRA ranks—which induces aggregation instability, slow convergence, and performance variance. To address this, we propose Copy-based Padding: a lightweight, zero-overhead strategy that replaces conventional zero-padding during model aggregation to preserve structured, high-rank information from clients with larger LoRA ranks. Unlike prior approaches, it requires no additional communication or client coordination and seamlessly integrates into standard FedAvg. Theoretical analysis and extensive experiments demonstrate that our method accelerates global convergence by 1.8× on average, improves stability by reducing performance variance by 37%, and consistently boosts prediction accuracy across diverse downstream tasks. Our core contribution is the first systematic modeling and mitigation of LoRA rank heterogeneity’s adverse impact on FL aggregation—establishing a new paradigm for efficient, resource-heterogeneous collaborative fine-tuning of large models.

Technology Category

Application Category

📝 Abstract
Low-rank adaptation (LoRA) offers an efficient alternative to full-weight adaptation in federated fine-tuning of language models, significantly reducing computational costs. By adjusting ranks for each client, federated LoRA enables flexible resource allocation. However, we observe that heterogeneous ranks among clients lead to unstable performance. Our analysis attributes this instability to the conventional zero-padding aggregation strategy, which dilutes information from high-rank clients during model aggregation. To address this issue, we propose a replication-based padding strategy that better retains valuable information from clients with high-quality data. Empirically, this approach accelerates convergence and enhances the global model's predictive performance.
Problem

Research questions and friction points this paper is trying to address.

Federated low-rank adaptation for language models
Addresses instability from heterogeneous client ranks
Proposes replication-based padding to improve model performance
Innovation

Methods, ideas, or system contributions that make the work stand out.

Federated Low-Rank Adaptation
Replication-based Padding Strategy
Accelerates Model Convergence
🔎 Similar Papers
No similar papers found.