FedSVD: Adaptive Orthogonalization for Private Federated Learning with LoRA

📅 2025-05-19
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
In differentially private federated learning (DP-FL), LoRA fine-tuning suffers from quadratic noise amplification due to DP-SGD’s per-sample gradient perturbation followed by BA matrix multiplication; freezing the A matrix mitigates noise but impairs representational capacity. To address this, we propose an SVD-based global reparameterization mechanism: clients upload only the low-rank matrix B, while the server dynamically reconstructs an orthogonal A matrix via adaptive singular value decomposition (SVD). This is the first work to integrate SVD reparameterization into private federated LoRA training. Our method preserves LoRA’s full expressivity while structurally suppressing noise amplification and provides a theoretical gradient norm bound. Experiments across multiple privacy budgets and benchmark tasks demonstrate significant improvements in model stability and accuracy, consistently outperforming baselines including frozen-A LoRA and standard LoRA.

Technology Category

Application Category

📝 Abstract
Low-Rank Adaptation (LoRA), which introduces a product of two trainable low-rank matrices into frozen pre-trained weights, is widely used for efficient fine-tuning of language models in federated learning (FL). However, when combined with differentially private stochastic gradient descent (DP-SGD), LoRA faces substantial noise amplification: DP-SGD perturbs per-sample gradients, and the matrix multiplication of the LoRA update ($BA$) intensifies this effect. Freezing one matrix (e.g., $A$) reduces the noise but restricts model expressiveness, often resulting in suboptimal adaptation. To address this, we propose FedSVD, a simple yet effective method that introduces a global reparameterization based on singular value decomposition (SVD). In our approach, each client optimizes only the $B$ matrix and transmits it to the server. The server aggregates the $B$ matrices, computes the product $BA$ using the previous $A$, and refactorizes the result via SVD. This yields a new adaptive $A$ composed of the orthonormal right singular vectors of $BA$, and an updated $B$ containing the remaining SVD components. This reparameterization avoids quadratic noise amplification, while allowing $A$ to better capture the principal directions of the aggregate updates. Moreover, the orthonormal structure of $A$ bounds the gradient norms of $B$ and preserves more signal under DP-SGD, as confirmed by our theoretical analysis. As a result, FedSVD consistently improves stability and performance across a variety of privacy settings and benchmarks, outperforming relevant baselines under both private and non-private regimes.
Problem

Research questions and friction points this paper is trying to address.

Reduces noise amplification in private federated LoRA training
Balances model expressiveness and privacy in DP-SGD
Improves adaptation via global SVD reparameterization
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses SVD for global reparameterization in FL
Optimizes only B matrix to reduce noise
Orthonormal A captures principal update directions
🔎 Similar Papers
2024-05-22Neural Information Processing SystemsCitations: 6