Towards Federated Low-Rank Adaptation of Language Models with Rank Heterogeneity

📅 2024-06-25

📈 Citations: 1

✨ Influential: 0

career value

180K/year

🤖 AI Summary

In federated learning (FL), low-rank adaptation (LoRA) of large language models suffers from client-wise rank heterogeneity—i.e., clients employing disparate LoRA ranks—which induces aggregation instability, slow convergence, and performance variance. To address this, we propose Copy-based Padding: a lightweight, zero-overhead strategy that replaces conventional zero-padding during model aggregation to preserve structured, high-rank information from clients with larger LoRA ranks. Unlike prior approaches, it requires no additional communication or client coordination and seamlessly integrates into standard FedAvg. Theoretical analysis and extensive experiments demonstrate that our method accelerates global convergence by 1.8× on average, improves stability by reducing performance variance by 37%, and consistently boosts prediction accuracy across diverse downstream tasks. Our core contribution is the first systematic modeling and mitigation of LoRA rank heterogeneity’s adverse impact on FL aggregation—establishing a new paradigm for efficient, resource-heterogeneous collaborative fine-tuning of large models.

Technology Category

Application Category

📝 Abstract

Low-rank adaptation (LoRA) offers an efficient alternative to full-weight adaptation in federated fine-tuning of language models, significantly reducing computational costs. By adjusting ranks for each client, federated LoRA enables flexible resource allocation. However, we observe that heterogeneous ranks among clients lead to unstable performance. Our analysis attributes this instability to the conventional zero-padding aggregation strategy, which dilutes information from high-rank clients during model aggregation. To address this issue, we propose a replication-based padding strategy that better retains valuable information from clients with high-quality data. Empirically, this approach accelerates convergence and enhances the global model's predictive performance.

Problem

Research questions and friction points this paper is trying to address.

Federated low-rank adaptation for language models

Addresses instability from heterogeneous client ranks

Proposes replication-based padding to improve model performance

Innovation

Methods, ideas, or system contributions that make the work stand out.

Federated Low-Rank Adaptation

Replication-based Padding Strategy

Accelerates Model Convergence

🔎 Similar Papers

No similar papers found.

💼 Related Jobs

PhD GenAI Research Scientist Intern

Databricks

SF Bay Area Hourly Rate$54—$60 USD

San Francisco, CA, USA

Research Engineer, Monetization AI